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FLAG GENE FAMILY AND TUMORIGENESIS 

The present invention relates to the identification of 
the FLAG gene family as a family of genes frequently 
5 associated with tumorigenesis . The invention in particular 
relates to the identification of one member of this gene 
family that is involved in benign tumors such as those with 
chromosome anomalies involving a particular region of the 
long arm of chromosome 8, for instance but not restricted to 
10 pleomorphic adenomas of the salivary glands with involvement 
of chromosome 8gl2 and a translocation partner chromosome, 
very frequently chromosome 3 [i.e. t (3 ;8) (p2l;ql2) ] • In 
addition, the invention relates to another member of this 
novel family as the gene whose expression is frequently 
15 abrogated in malignant tumors such as for example but not 
restricted to malignant salivary gland tumors. Furthermore, 
the invention concerns the identification of the CTNNBl gene 
as a prototype of tumor-specific breakpoint region genes and 
frequent fusion partner of the FLAG genes. The invention 
20 relates in particular to the use of the members of the FLAG 
gene family and corresponding fusion partner genes as well 
as their derivatives in diagnosis and therapy- 

Multiple independent cytogenetic studies have firmly 
implicated chromosome region 8qll-13 in the following tumor 
25 types: pleomorphic adenomas of the salivary glands (high 
frequency) ^ pleomorphic adenomas of the lacrimal glands 
(high frequency) , lipoblastomas (high frequency) , solitary 
lipomas (low frequency), rhabdomyosarcomas (low frequency), 
and renal cell carcinomas (low frequency) . As an 
3 0 illustration, we here focus on pleomorphic adenomas of the 
salivary gland. This type of tumor constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary 
gland tumor and accounts for almost 50% of all neoplasms in 
35 these organs; 85% of the tumors are found in the parotid 
gland, 10% in the minor salivary glands and 5% in the 
submandibular gland. About 50% of these adenomas appear to 
have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome anomalies. Frequently 
observed anomalies include aberrations of chromosome 8, 
usually involving the 8ql2 region with as most common 
aberration a t(3 ?8) (p21;ql2) , and aberrations of chromosome 
5 12, usually translocations involving region 12ql3-ql5. Non- 
recurrent clonal chromosome abnormalities have also been 
reported. The highly specific pattern of chromosome 
rearrangements with consistent breakpoints at 8ql2 and 
12ql3-ql5 suggests that these chromosomal regions harbour 
10 genes that might be implicated in the development of these 
tumors . 

In previous physical mapping studies, the chromosome 
8ql2 breakpoint in pleomorphic adenoma of the salivary 
glands was mapped between MOS and PENK . in an effort to 

15 molecula'riv i^lnno -hka >vAnn -.-^^^—^-^ • ^^.t. _ 

J u^^Bi^ucsu uy une cnromosome 8ql2 

aberrations in the various tumors, the present inventors 
chose directional chromosome walking as a structural appro- 
ach to define the DNA region encompassing these breakpoints. 
As a starting point for chromosome walking, the MPS 

20 locus was used. During these walking studies, YAC clones 

corresponding to eight different loci in 8qll-i2 were isola- 
ted and mapped by FISH analysis. Initially, the breakpoints 
were mapped within a 1 Mb region flanked by MOS proximal ly 
and by the genetic marker D8S166 distal ly. one YAC (CEPH 

25 166F4) within this region was shown to span the t(3;8) 

breakpoint in two tumors. Subsequently, it was shown by FISH 
that the chromosomal breakpoints as present in a number of 
primary pleomorphic adenomas were clustered within a small 
chromosomal segment which has been designated pleomorphic 

3 0 adenoma breakpoint cluster region (PA-BCR) . 

It has thus been found that essentially all breakpoints 
of chromosome 8ql2 map in a region indicated herein as PA- 
BCR. Further research revealed that in this region a member 
of the FLAG gene family, the PLAGl gene, can be identified 

35 as the breakpoint target gene and postulated tumor aberrant 
growth gene. The PIAg gene family is a subclass of the zinc 
finger gene superfamily. The CTNNBl gene was identified as 
the fusion partner gene of PLAGl. Molecular analysis of the 
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chromosome breakpoints revealed that the chromosome translo- 
cations resulted in gene fusions. Fusions appeared to occur 
in the 5 • -non-coding regions of both genes f PLAGl and CTNN- 
Bl) , exchanging regulatory control elements while preserving 
5 the coding sequences. Due to the t (3 ; 8) (p21;ql2) , PLAGl is 
activated and expression of CTNNBl is down-regulated. Acti- 
vation of PLAGl was also observed in adenomas with variant 
translocations, i.e. t(5;8) and t(8;15). The results of the 
present inventors indicate that PLAGl activation due to 

10 promoter swapping is a crucial event in salivary gland 
tumor igenes is , 

The preferential fusion partner of PLAGl is the CTNNBl 
gene, which encodes B-catenin, a cytoplasmic protein of 
about 88 kD. Its frequent involvement in pleomorphic adeno- 

15 mas point towards a critical role of the CTNNBl gene. The 
major role of CTNNBl may be to provide an active promoter in 
front of the PLAGl gene. However, since the observed promo- 
ter swapping between PLAGl and CTNNBl leads to down-regula- 
tion of CTNNBl expression, it may also lead to other physio- 

2 0 logical consequences, especially since fi-catenin is a pro- 

tein that has been implicated in highly diverse processes, 
such as human colon cancer, epithelial cell adhesion, embry- 
onal axis formation in Xenopus ^ and pattern formation in 
DrosoDhlla. 

25 According to the present invention the aberrations in 

the PLAGl gene on chromosome 8 and the CTNNBl gene on chro- 
mosome 3 have been used as a model to reveal the more gene- 
ral concept of the involvement of members of these gene 
families in tumorigenesis . Although both gene families are 

3 0 known per se, up till the present invention the correlation 

between these families and tumor inducing chromosome aberra- 
tions, like translocations, deletions, insertions and inver- 
sions, has not been anticipated. Furthermore, until now, it 
was not previously demonstrated that alterations in the 
35 physiological expression level of the members of the gene 
family are probably also implicated in tumor development. 
According to the invention, it was demonstrated that in 
normal adult cells the expression level of the PLAGl gene is 
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practically undetectable, whereas in aberrantly growing 
cells, the expression level is significantly increased. It 
appears that PLAGl is a oncogene. 

The research that led to the present invention 
5 indicates that it is probable that abberations in the PLAGl 
gene, either in its sequence and/or its expression, will 
usually lead to a benign tumor. Furthermore, preliminary 
research indicated that expression of PLAGl is increased in 
various types of malignant salivary gland tumors. 

10 Another member of the PLAG gene family is the PliAG2 

gene. The present inventors have identified this gene, 
determined its nucleotide sequence and predicted its amino 
acid sequence. It was found that PLAG2 mapped to a region 
frequently deleted in malignant salivary gland tumors 

15 (6q24) . Thus- contrary to PLAGl . PLAG 2 may be a 
tumorsuppressor gene. 

The present invention now provides for a tool to 
investigate these theories and may ultimately lead to a 
means for distinguishing between benign and malignant 

20 tumors. This knowledge can then lead to more efficient 

methods of treatment. Thus, the present invention now provi- 
des for the members of the gene families or derivatives 
thereof in isolated form and their use in diagnostic and 
therapeutic applications. Furthermore, the knowledge on the 

25 location and nucleotide sequence of the genes may be used to 
study their rearrangements or expression and to identify a 
possible increase or decrease in their expression level and 
the effects thereof on cell growth. Based on this informati- 
on diagnostic tests or therapeutic treatments may be desig- 



In this application, " PLAG " will be used to indicate 
the involvement of these types of genes in various types of 
tumors, not necessarily restricted to pleomorphic adenomas 
of the salivary glands. The term refers to all members of 
35 the PLAG gene family involved in non-physiological 

proliferative growth, and in particular involved in benign 
or malignant tximors. Members of the PLAG gene family show 
homology between zinc finger domains that are typical for 



3 0 ned . 
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the PLAGl gene. However, according to the invention it was 
furthermore found that even breaks outside the actual genes 
but in the vicinity thereof might result in aberrant growth. 
The term " PIAG gene" is therefore also intended to include 
5 the immediate vicinity of the gene. The skilled person will 
understand that the "immediate vicinity" should be 
understood to include the surroundings of the gene in which 
the breaks or mutations will result in the above-defined 
non-physiological proliferative growth. 
10 The term "Tumorigenesis gene" or "T-gene" will be used 

to indicate all members of this novel PLAG gene family and 
their corresponding translocation or fusion partners, like 
CTNNBl . 

The term "wildtype cell" is used to indicate the cell 
15 not harbouring an aberrant chromosome. "Wildtype" or "nor- 
mal" chromosome refers to a non-aberrant chromosome. 

The present invention provides for various diagnostic 
and therapeutic applications that are based on the informa- 
tion that may be derived from the genes. This information 
20 not only encompasses its nucleotide sequence or the amino 

acod sequence of the gene product derived from the gene, but 
also involves the levels of transcription or translation of 
the gene. 

The invention is thus two-fold. On the one hand the 
25 aberration in cell growth may be directly or indirectly 

caused by the physical breaks that occur in the gene or its 
vicinity. On the other hand the aberration in cell growth 
may be caused by a non-physiological expression level of the 
gene. This non-physiological expression level may be caused 
3 0 by the break, or may be due to another stimulus that activa- 
tes or deactivate the gene. At present, the exact mechanism 
or origin of the aberrant cell growth is not yet completely 
unraveled. However, exact knowledge at this time is not 
necessary to define methods of diagnosis or treatment. 
35 Diagnostic methods according to the invention are thus 

based on the fact that an aberration in a chromosome results 
in a detectable alteration in the chromosomes' appearance or 
biochemical behaviour. A translocation, for example will 
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result in a first part of the chromosome (and consequently 
of a FLAG gene) having been substituted for another (second) 
part (further referred to as "first and second substitution 
parts"). The first part will often appear someplace else on 
5 another chromosome from which the second part originates. As 
a consequence hybrids will be formed between the remaining 
parts of both (or in cases of triple translocations, even 
more) chromosomes and the substitution parts provided by 
their translocation partners. Since it has now been found 

10 that the breaks occur in a FLAG gene, this will result in 
hybrid gene products of that FLAG gene. Markers, such as 
hybridising molecules like RNA, DNA or DNA/RNA hybrids, or 
antibodies will be able to detect such hybrids, both on the 
DNA level, and on the RNA or protein level. 

15 For example, the transcript of a hybrid will still 

comprise the region provided by the remaining part of the 
gene /chromosome but will miss the region provided by the 
substitution part that has been translocated. In the case of 
inversions, deletions and insertions the gene may be equally 

20 afflicted. 

Translocations are usually also cytogenetically detec- 
table. The other aberrations are more difficult to find 
because they are often not visible on a cytogenetical level. 
The invention now provides possibilities for diagnosing all 

25 these types of chromosomal aberrations. 

In translocations markers or probes based on the FLAG 
gene for the remaining and substitution parts of a chromoso- 
me in sjtu detect the remaining part on the original chromo- 
some but the substitution part on another, the translocation 

3 0 partner • 

In the case of inversions for example, two probes will 
hybridise at a specific distance in the wildtype gene. This 
distance might however change due to an inversion. In situ 
such inversion may thus be visualized by labeling a set of 
3 5 suitable probes with the same or different detectable mar- 
kers, such as fluorescent labels. Deletions and insertions 
may be detected in a similar manner. 
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According to the invention the above in situ applicati- 
ons can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmids one of which 
comprises exon 1 and the upstream region of the FLAG gene, 
5 while the other comprises the last exon and its downstream 
region. Both cosmids are labeled with different fluorescent 
markers, e.g. blue and yellow. The normal chromosome will 
show a combination of both labels, thus giving a green 
signal, while the translocation is visible as a blue signal 

10 on the remaining part of one chromosome (e.g. 8) while the 
yellow signal is found on another chromosome comprising the 
substitution part. In case the same labels are used for both 
probes, the intensity of the signal on the normal chromosome 
will be 100%, while the signal on the aberrant chromosomes 

15 is 50%. In the case of inversions one of the signals shifts 
from one place on the normal chromosome to another on the 
aberrant one. 

In the above applications a reference must be included 
for comparison. Usually only one of the two chromosomes is 

20 afflicted. It will thus be very convenient to use the normal 
chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or 
unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the PIAGl gene 

25 of chromosome 8, breaks are usually found in the 5'-UTR as 
is shown by the present invention. Probes based on the 5'- 
and 3»-UTR are thus very useful. As an alternative a combi- 
nation of probes based on both translocation or fusion 
partners may be used, e.g. 5»-UTR of PLAGl and 3»-UTR of 

30 CTNNBl . 

"Probes" as used herein should be widely interpreted 
and include but are not limited to linear DNA or RNA 
strands. Yeast Artificial Chromosomes (YACs) , or circular 
DNA forms, such as plasmids, phages, cosmids etc.. 
35 These in situ methods may be used on metaphase and 

interphase chromosomes. 

Besides the above described in situ methods various 
diagnostic techniques may be performed on a more biochemical 
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level, for example based on alterations in the DNA, RNA or 
protein or on changes in the physiological expression level 
of the gene. 

Basis for the methods that are based on alterations in 
5 the chromosome's biochemical behavior is the fact that by 
choosing suitable probes, variations in the length or compo- 
sition in the gene, transcript or protein may be detected on 
a gel or blot. Variations in length are visible because the 
normal gene, transcript (s) or prcftein(s) will appear in 

10 another place on the gel or blot then the aberrant one(s) . 
In case of a translocation more than the normal number of 
spots will appear. 

Based on the above principles the invention thus rela- 
tes to a method of diagnosing cells having a non-physiologi- 

15 cal proliferative capacity, comprising the steps of taking a 
biopsy of the cells to be diagnosed, isolating a suitable 
PIAG gene-related macromolecule therefrom, and analysing the 
macromolecule thus obtained by comparison with a reference 
molecule originating from cells not showing a non-physiolo- 

20 gical proliferative capacity, preferably from the same 

individual. The PLAG gene-related macromolecule may thus be 
a DNA, an RNA or a protein. The FLAG gene may be either a 
member of the EL&Gl family or of the translocation partner 
gene family of which the CTNNBi gene is the prototype. 

2^ a specific embodiment the diagnostic method of the 

invention comprises the steps of taking a biopsy of the 
cells to be diagnosed, extracting total RNA thereof, prepa- 
ring a first strand cDNA of the mRNA species in the total 
RNA extract or poly-A-selected fraction (s) thereof, which 

30 CDNA comprises a suitable tail; performing a PGR using a 
EIAG gene-specific primer and a tail-specific primer in 
order to amplify pi^G gene-specific cDNA's; separating the 
PGR products on a gel to obtain a pattern of bands; evalua- 
ting the presence of aberrant bands by comparison to wildty- 

35 pe bands, preferably originating from the same individual. 

As an alternative amplification may be performed by 
means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [Compton, J. (1991) Nucleic acid sequence- 
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based amplification. Nature 350,91-92) or variations 
thereof . 

In another embodiment the method comprises the steps of 
taking a biopsy of the tumor to obtain cells to be 
5 diagnosed, isolating total protein therefrom, separating the 
total protein on a gel to obtain essentially individual 
bands, optionally transfering the bands to a Western blot, 
hybridising the bands thus obtained with antibodies directed 
against a part of the protein encoded by the remaining part 

10 of the FLAG gene and against a part of the protein encoded 
by the substitution part of the FLAG gene; visualising the 
antigen-antibody reactions and establishing the presence of 
aberrant bands by comparison with bands from wildtype 
proteins, preferably originating from the same individual. 

15 In a further embodiment the method comprises taking a 

biopsy of the tumor to obtain cells to be diagnosed; 
isolating total DNA therefrom; digesting the DNA with one or 
more so-called "rare cutter" (typically "6- or more 
cutters") restriction enzymes; separating the digest thus 

20 prepared on a gel to obtain a separation pattern; optionally 
transfering the separation pattern to a Southern blot; 
hybridising the separation pattern in the gel or on the blot 
with a set of probes under hybridising conditions; 
visualising the hybridisations and establishing the presence 

25 of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual* 

Changes in the expression level of the gene may be 
detected by measuring mRNA levels or protein levels by means 
of a suitable probe. 

3 0 Diagnostic methods based on abnormal expression levels 

of the gene may comprise the steps of taking a sample of the 
tumor to obtain cells to be diagnosed; isolating mRNA 
therefrom; and establishing the presence and/or the 
(relative) quantity of mRNA transcribed from the FLAG gene 
35 of interest in comparison to a control. Establishing the 
presence or (relative) quantity of the mRNA may be achieved 
by amplifying at least part of the mRNA of the FLAG gene by 
means of RT-FCR or similar amplification techniques. In an 
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alternative embodiment the expression level may be 
established by determination of the presence or the amount 
of the gene product (e.g. protein) by means of for example 
monoclonal antibodies. 
5 The diagnostic methods of the invention may be used for 

diseases wherein cells having a non-physiological prolifera- 
tive capacity are selected from the group consisting of 
benign tumors, such as the tumors pleomorphic adenomas of 
the salivary glands, lipoblastomas, uterine leiomyomas, and 
10 other benign tumors as well as various malignant tumors, 
including but not limited to sarcomas (e.g. rhabdomyosarco- 
ma) and leukemias and lymphomas. 

As already indicated above, it has been found that in 
certain malignant tumors the expression level of the plag 
15 genes is increased. Another aspect of the invention thus 
relates to the implementation of the identification of the 
Eli^ genes in therapy. The invention for example provides 
anti-sense molecules or expression inhibitors of the flag 
gene for use in the treatment of diseases involving cells 
20 having a non-physiological proliferative capacity by modula- 
ting the expression of the gene. 

The invention thus provides derivatives of the flag 
gene and/or its immediate environment for use in diagnosis 
and the preparation of therapeutical compositions, wherein 
25 the derivatives are selected from the group consisting of 
sense and anti-sense cDNA or fragments thereof, transcripts 
of the gene or fragments thereof, antisense RNA, triple 
helix inducing molecule or other types of "transcription 
clamps", fragments of the gene or its complementary strand, 
30 proteins encoded by the gene or fragments thereof, protein 
nucleic acids (PNA) , antibodies directed to the gene, the 
CDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. Besides the use of direct 
derivatives of the genes and their surroundings (flanking 
35 sequences) in diagnosis and therapy, other molecules, like 
expression inhibitors or expression enhancers, may be used 
for therapeutic treatment according to the invention. An 
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example of this type of molecules are ribozymes that destroy 
RNA molecules. 

Bsldes the above-described therapeutic and diagnostic 
methods, the principles of the invention may also be used 
5 for producing a transgenic animal model for testing pharma- 
ceuticals for treatment of ELAG-related malignant or benign 
tumors- One of the examples describes the production of such 
an animal model. 

It is to be understood that the principles of the 

10 present invention are described herein for illustration 
purposes only with reference to the PLAGI gene mapping at 
chromosome 8ql2 and the CTNNBl gene on chromosome 3p21. 
Based on the information provided in this application the 
skilled person will be able to isolate and sequence corres- 

15 ponding genes of the gene family and apply the principles of 
this invention by using the gene and its sequence without 
departing from the scope of the general concept of this 
invention. 

The present invention will thus be further elucidated 
20 by the following examples which are in no way intended to 
limit the scope thereof. 



EXAMPLES 



2 5 EXAMPLE 1 

1 . Introduction 

This example describes the isolation and analysis of 
overlapping YAC clones and the establishment of a YAC contig 
(set of overlapping clones) , which spans genomic DNA around 

30 the MOS locus and includes the translocation breakpoints, 
t(3;8) (p21;ql2) , of pleomorphic salivary glands (Fig. 8). 
Pleomorphic adenomas are benign epithelial tumors 
originating from the major and minor salivary glands. Cyto- 
genetic analysis has indicated that these tumors mainly 

35 display chromosome breakpoints in region ql2 of chromosome 
8. In previous studies we have shown that these breakpoints 
are located in the 9 cM interval between MOS / D8S285 and 
D8S260. Here we report directional chromosome walking stu- 
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dies starting from D8S260 and D8S285, resulting in, respec- 
tively, a rudimental YAC contig of 5 Mb and a highly detai- 
led YAC contig and long-range physical map of about 2 Mb. 
The latter YAC contig has at least double coverage and 
5 consists of 34 overlapping YAC clones isolated from the CEPH 
and ICRF YAC libraries. Their insert sizes were estimated by 
contour-clamped homogeneous electric field (CHEF) gel elec- 
trophoresis. Chromosomal localization of the YACs was inves- 
tigated by FISH analysis. On the basis of YAC insert and 

10 insert-end derived DNA markers and (publicly known) sequence 
tag sites (STSs) , with an average spacing of 65 kbp, as well 
as restriction enzyme analysis, a long-range physical map 
was established for this centromeric region of chromosome 
8ql2. Within the YAC contig, the relative positions of 

15 various known genes and expressed sequence tag sites was 
determined. This YAC contig constitutes the basis for the 
construction of a transcriptional map of this region. FISH 
analysis using YACs and cosmids, corresponding to STSs in 
this YAC contig, already revealed that all but two of the 

20 chromosome 8ql2 breakpoints in the primary tumors that were 
tested, mapped within a 300 kb interval between the MOS 
proto-oncogene and STS EM156. 

2. Materials and methods 
25 2.1. Pleomorphic adenomas of the salivary glands and cytoge- 
netic analysis. 

Primary pleomorphic adenomas of the salivary glands 
were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were established and 
30 analyzed as described before (RSijer et al., 1996). The 

primary tumors with their aberrations used in this study are 
listed in Table l. Slides for FISH were prepared from cells 
stored in fixative at -20»c. 

35 2.2. Fluorescence in situ hybridization (FISH) 

Slides for FISH were aged for 2-10 days, or were trea- 
ted with 2x SSC (pH 7.0) at 37<>C for 30 min, and subse- 
quently denatured in 70% formamide in 2x SSC (pH 7.0), at 
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75**C for 2-6 minutes, Cosmid and YAC DNAs (1 ^g) were labe- 
led with biotin-16"dUTP {Boehringer Mannheim) by nick trans- 
lation. Labeled DNA was purified through a Sephadex G50 
column (Pharmacia) , coprecipitated with 50 mg sonicated 
5 human placenta DNA (Sigma) , and dissolved in hybridization 
solution (50% formamide, 2x SSC pH 7.0, 50 mM sodium phosp- 
hate pH 7.0, 10% dextran sulfate). Hybridization and probe 
detection were carried out as previously described (Roijer 
et al., 1996). The alpha-satellite probe D8Z2 was obtained 

10 from Oncor. Slides were examined in a Zeiss Axiophot 
epif luorescence microscope using the appropriate filter 
combinations. Fluorescence signals were digitalized, 
enhanced and analyzed using the ProbeMaster FISH image 
analysis system (Perceptive Scientific Instruments, Houston, 

15 Texas) • Color prints were produced using a Kodak XL 7700 
monochrome continuous printer. 

2 . 3 DNA preparations . 

Extraction of genomic DNA and the Southern analyses 

20 were performed as described previously (Schoenmakers et al,, 
1994) . DNA from YACs, cosmids, PGR products and oligonucleo- 
tides was labelled using a variety of techniques- For FISH, 
cosmid clones or inter-Alu PGR products of YAGs were bioti- 
nylated with biotin-ll-dUTP (Boehringer) by nick translati- 

25 on. YAC DNA (100 ng) was amplified by inter Alu PGR (PI: 

CTGGACTCGAGGGTGGG , P2 : TGCCAAAGTGCTGGGATTACAG) . After initi- 
al denaturation for 5 min at 94**C, 30 amplification cycles 
were performed each consisting of denaturation for 1 min at 
94 *C, annealing for 30 sec at 37**C and extension for 6 min 

30 at 72 ^'C, and with a final extension at 72 °C for 10 min. 
Amplified DNA was purified with QIAQuick PGR Purification 
kit (Qiagen) . For filter hybridizations, probes were 
radio-labelled with alpha-^^P-dGTP using random hexamers 
(Feinberg and Vogelstein, 1984) . In case of PGR-products 

35 smaller than 200 bp in size, a similar protocol was applied, 
but specific oligonucleotides were used to prime labelling 
reactions. Oligonucleotides were labelled using 
gamma-'^P-ATP . 
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2.4 YAC library screening* 

YAC clones in this paper were isolated from the CEPH 
mark 1 (Albertsen et al., 1990) and CEPH mark 3 (Chumakov et 
al., 1992) YAC library^ made available to us by the Centre 
5 d« Etude du Polymorphisme Humain (CEPH). YACs were isolated 
using a combination of PCR-based screening and colony hy- 
bridization analysis (Green and Olson, 1990) . Contaminating 
Candida parapsylosis, which was sometimes encountered, was 
eradicated by adding terbinafin to the growth medium (final 
10 concentration of 25 microgram/ml) . The isolated YAC clones 
were characterized by STS-content mapping, contour-clamped 
homogeneous electric field (CHEF) electrophoresis (Chu et 
al,, 1986), restriction mapping and hybridization and FISH 
analysis. 

15 

2.5 Cosmid library screening. 

Cosmid clones were isolated from an arrayed human 
chromosome 8-specific cosmid library (Wood et al., 1992) 
obtained from Los Alamos National Laboratory (LANL) . 
20 LANL-derived cosmid clones are indicated by their microtiter 
plate addresses. Cosmid DNA was extracted using standard 
techniques involving purification over Qiagen tips (Qiagen) • 

2.6 Pulsed-field gel electrophoresis and Southern blot 
25 analysis. 

Pulsed-field gel electrophoresis and Southern blot 
analysis were performed exactly as described by Schoenmakers 
et al. (1994). Agarose plugs containing high-molecular 
weight YAC DNA (equivalent to about 1 x 108 yeast cells) 

30 were twice equilibrated in approximately 25 ml TE buffer (pH 
8.0) for 3 0 min at 50 **C followed by two similar rounds of 
equilibration at room temperature. Plugs were subsequently 
transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 30 min in 500 microliter of the approp- 

35 riate 1 x restriction-buffer at the appropriate restriction 
temperature . 

Thereafter, DNA was digested in the plugs according to 
the suppliers (Boehringer) instructions for 4 h using 3 0 



BNSDOCID: <WO 9807748A2J_> 



wo 98/07748 




PCT/EP97/04759 



15 



units Of restriction endonuclease per digestion reaction. 
After digestion, plugs along with appropriate molecular 
weight markers were loaded onto a 1% agarose / 0.25 x TBE 
gel, sealed with LMP-agarose and size fractionated on a CHEF 
5 apparatus (Biorad) for 18 h at 6.0 V/cm using a pulse angle 
of 120 degrees and constant pulse times varying from 10 sec 
(separation up to 300 kbp) to 20 sec (separation up to 500 
kbp) . In the case of large restriction fragments, additional 
runs were performed, aiming at the separation of fragments 
10 with sizes above 500 kbp. 

Electrophoresis was performed at 14 "C in 0.25 x TBE. As 
molecular weight markers, lambda ladders (Promega) and 
home-made plugs containing lambda DNA cut with restriction 
endonuclease Hindlll were used. After electrophoresis, gels 
15 were stained with ethidium bromide, photographed, and UV 
irradiated using a stratalinker (Stratagene) set at 120 mJ. 
DNA was subsequently blotted onto Hybond N+ membranes 
(Amersham) for 4-16 h using 0.4 N NaOH as transfer buffer. 
After blotting, the membranes were dried for 15 min at 80 °C, 
20 briefly neutralised in 2 x SSPE, and prehybridised for at 
least 3 h at 42 °C in 50 ml of a solution consisting of 50% 
formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 
microgram/ml heparin. Filters were subsequently hybridised 
for 16 h at 42 *C in 10 ml of a solution consisting of 50% 
25 formamide, 5 x SSPE, 1 x Denhardts, 0.1% SDS, 100 

microgram/ml heparin, 0.5% dextran sulphate and 2-3 x 106 
cpm/ml of labelled probe. Thereafter, membranes were first 
washed two times for 5 min in 2 x SSPE/ 0.1% SDS at room 
temperature, then for 30 min in 2 x SSPE/0.1% SDS at 42 °C 
30 and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65 °C. 
Kodak XAR-5 films were exposed at -80 °C for 3-16 h, 
depending on probe performance. Intensifying screens (Kyokko 
special 500) were used. 

Agarose plugs containing high-molecular weight yeast + 
35 YAC DNA (equivalent to 1 x lo' cells ml"^) were prepared as 
described before (Schoenmakers et al., 1994). Plugs were 
thoroughly dialysed against four changes of 25 ml TlOEl (pH 
8.0) followed by two changes of 0.5 ml 1 x restriction 
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buffer before they were subjected to either pulsed-field 
restriction enzyme mapping or YAC-end rescue. 

2.7 PGR analysis. 
5 PGR amplification was carried out using a Pharmacia 

LKB-Gene ATAQ Gontroller (Pharmacia/LKB) or a Perkin Elmer 
9600 (Perkin-Elmer Cetus) in final volumes of respectively 
50 and 25 microliter containing lO mM Tris-HCl pH 8.3, 50 mM 
KCl, 1.5 mM MgClj, 0.01% gelatine, 2 mM dNTPs, 20 pmole of 

10 each amplimer, 1.25 units of AmpliTaq (Perkin-Elmer Getus) , 
and 100 ng {for superpools) or 20 ng (for pools) of DNA. 
After initial denaturation for 5 min at 94 »G, 30 amplifica- 
tion cycles were performed each consisting of denaturation 
for 1 min at 94 °G, annealing for l min at the appropriate 

15 temperature (see Table 3), and extension for l min at 72 °c, 
and with a final extension at 72 °C for 10 min. Results were 
evaluated by analysis of lo microliter of the reaction 
product on 2% agarose gels. 

20 2.8 Generation of STSs from YAG inserts and insert ends. 

YAC-end rescue was performed using a vectorette-PCR 
procedure in combination with direct solid phase DNA sequen- 
cing, as described before (Geurts et al., 1994). STSs speci- 
fic for YAG inserts were generated from inter-Alu PGR pro- 

25 ducts, isolated using published oligonucleotides TC65 or 517 
(Nelson et al., 1989) to which Sall-tails were added to 
facilitate cloning. Figure 10 shows some of these STSs. 
After sequence analysis, primer pairs were developed using 
the OLIGO computer algorithm (Rychlik, 1989) . They were 

30 tested on human genomic DNA, basically according to 
procedures described above. 

2.9 Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the 
35 dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Gycle Sequen- 
cing System (GIBGO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer^" (Pharmacia Biotech) on 



BNSDOCID: <WO 9807748A2_L> 



wo 98/07748 




PCT/EP97/04759 



17 



Standard 30 cm, 6% Hydrolink*, Long Range^'^ gels (AT Bio- 
chem) • Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI; Altschul et al, , 1990). All 
oligonucleotides were purchased from Pharmacia Biotech. 

5 

3 . Results 

3.1 Isolation of YAC clones to start chromosome walking. 
According to radiation hybrid mapping data, band 8ql2 

is flanked by markers D8S165, D8S285 proximally, and by 

10 D8S260 distally (Sapru et al., 1994). The estimated genetic 
distance between these markers is 9 cM (Gyapay et al., 
1994) . Using primer sets for these microsatellite markers, 
we selected sets of corresponding YAC clones. These YACs 
were used for initiating the walk towards the 8ql2 translo- 

15 cation breakpoints, meanwhile generating a long-range contig 
based upon STS-content mapping of these. Hence we started to 
construct a 8ql2 centromeric YAC contig (from D8S165 / 
D8S285) and a 8ql2 telomeric YAC contig (from D8S260) . By 
fluorescent in situ hybridisation (FISH) experiments, we 

20 could soon show that all but two of the translocation 

breakpoints in primary adenomas mapped within a 1 cM inter- 
val within the centromeric YAC contig (Roijer et al., 1996). 
Hence, the YAC contig of the telomeric side of 8ql2 was not 
characterized in that much detail. The two YAC contigs could 

25 not be linked to each other. 

3.2 Assembly of a YAC contig starting from D8S285 

In previous studies (Roijer et al. , 1996) we have 
described the isolation of two overlapping YACs containing 

30 D8S285, namely 935E9 and 166F4 . These YACs, together with a 
MOS containing YAC (900gl0157) , were used as a starting 
point for a chromosome walking project to define the 8ql2 
centromeric region, as mentioned in the introduction. Initi- 
ally, chromosome walking was performed bidirectionally . This 

35 allowed us to hook up our 8ql2 centromeric contig to a 
chromosome 8qll YAC contig. Both contigs have the marker 
D8S593 in common. In the subsequent PCR based screening of 
the human CEPH YAC libraries with D8S108 and D8S166, we 
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isolated MegaYAC 946B7, which together with MegaYAC 935E9, 
formed the backbone of our centromeric contig. 

In the bidirectional and subsequent unidirectional 
chromosome walking steps, the following general procedures 
5 were used. First, rescuing and sequencing the ends of YAC 
clones resulted in DNA markers characterizing their left and 
right sides (Table 2) • Based on sequence data on the ends of 
the YAC insert, as well as on inter-Alu PGR products, primer 
sets were developed for specific amplification of DNA, 

10 establishing STSs (Table 3). Their location to 8ql2-qter was 
determined by CASH as well as by FISH after corresponding 
YAC and/or cosmid clones were isolated. With the sequenti- 
ally selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the buil- 

15 ding blocks for contig assembly • Hence, we established a 

8ql2 centromeric YAC contig consisting of 34 overlapping YAC 
clones, covering approximately 2 Mb of DNA (Fig. 1). The 34 
YACs are between 160 kb and 1660 kb long. This YAC contig 
appeared to encompass the chromosome 8ql2 breakpoints of all 

20 but two primary adenomas studied. YAC 166F4 for instance, 
was already shown to cross the translocation breakpoint in 
two adenomas with t (3;8) (p21;ql2) (Roijer et al., 1996). 
Characteristics of the YACs that were used to build this 
contig are given in Table 2. 

25 The YAC contig was constructed in parallel with the 

screening data of Whitehead Institute / MIT Center for 
Genome Research available via http: //www-genome. wi .mit, edu- 
/cgi-bin/contig/lookup_contig (Contigs WC8.7 and WC-157) . 

30 3.3 Physical mapping of polymorphic markers, ESTs and genes 
All markers and genes which were or^ became publicly 
available during our work were STS content mapped on our 
contig and those found positive were subsequently sublocali- 
zed by (primer) hybridization on YAC Southern blots. The 

35 markers that were found to reside within the centromeric 
contig presented here were D8S1069 (WI-536) , AFMB055WG9, 
D8S125, D8S108, D8S1661 {WI^5459), D8S166, D8S96, D8S1816, 
D8S285, D8S1516 (WI-3862) , D8S1828 and D8S165. Furthermore, 
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besides the HOS proto-oncogene (Testa et al., 1988), two 
other genes could be positioned in our centroineric YAC 
contig. namely the YES related proto-oncogene LYN (Corey and 
Shapiro, 1994) and the preproenkephalin gene PENK (Tomfohrde 
5 et al., 1992). Finally, our contig contains 2 expressed 
sequence tags, namely WI-7754 (D8S1930) , corresponding to 
LYN, and IB2045, not resembling any currently known gene. 
The genes encoding the kappa opioid receptor (OPRKl) , 
(Yasuda et al., 1994) and interleukin-7 (IL7) (Brunton and 
10 Lupton, 1990) are not contained in our YAC contig. 

3.4 Restriction map and putative CpG islands: Construction 
of a rare-cutter physical map from the 2 Mb YAC contig 
in the 8ql2 centromeric region 

15 Southern blots of total yeast plus YAC DNA, digested to 

completion with rare-cutter enzymes BssHII, Kpnl, Mlul, 
NotI, Pvul, Sail and Sfil (see Materials and Methods) and 
separated on CHEF gels, were hybridized sequentially with 
(i) the STS used for the initial screening of the YAC in 

20 question, (ii) pYAC4 right arm sequences, (iii) pYAC4 left 
arm sequences, and (iv) a human Alu-repeat probe (BLUR-8) . 
The long-range restriction map that was obtained in this way 
was completed by probing with PCR- isolated STSs/YAC end 
probes. Restriction maps of individual YAC clones were 

25 aligned, and a consensus restriction map was established. 
The region was searched for CpG islands on the basis of the 
colocalization of sites for these rare-cutter enzymes (Fig. 

1) . 

3 0 3.5 FISH mapping of 8ql2 breakpoints in pleomorphic 
adenomas 

In previous studies we have mapped the t(3;8) (p21;ql2) 
breakpoint within a 1 Mb region flanked by MPS proximal ly 
and by the genetic marker D8S166 distally. One YAC within 
35 this region (166F4) was shown to span the t(3;8) breakpoint 
in two primary pleomorphic adenomas (R6ijer et al- (1996)). 
We have now extended these previous observations by analysis 
of additional tumors including cases with different variant 
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translocations (Table 1) . six of eight primary adenomas with 
8qi2 abnormalities had breakpoints napping within YAC 166F4 
(Fig. 2A) . Also rumors with other 8ql2 rearrangements than 
the classical t(3;8) had breakpoints within the same YAC, 
5 demonstrating that this is the major breakpoint cluster 
region in pleomorphic adenomas with 8ql2 abnormalities. 

To fine map this region we isolated cosmids from an 
arrayed human chromosome 8 -specific cosmid library (obtained 
from Los Alamos National Laboratory; Wood et al. (1992)) 
10 using markers contained within YAC 166F4. Cosmid clones 

containing MQS, EM156 and CH129 were isolated and mapped by 
FISH (Fig. 1). The breakpoints in all six cases were 
localized between cosmids CEMl (mqS) en CEM23 (EM156) 
(Fig. 2B and C) . In for example CG 666, CEMl was transposed 
15 to 8p due to a pericentric inversion of the dicentric marker 
chromosome (Fig. 2B) , while CEM23 remained in its normal 
position at 8ql2 (Fig. 2C) . In adenoma CG 682, which has an 
ins(8;3) (ql2;p21.3pi4.1), CEM23 spanned the insertion 
breakpoint (Roijer et al. (1997)). 
20 Two of the eight adenomas tested, CG 650 and CG 787, 

had breakpoints clearly proximal to MQS. To further map 
these breakpoints we isolated two additional sets of YACs 
corresponding to CH37, D8S593 and the recently identified 
gene (Blunt et al. (1995)), respectively, in CG 650 
25 the breakpoint was found to reside between ICRF YAC 
900gl0157 and YAC 898G12 (Fig. l) , and in CG 787 the 
breakpoint was found to reside within the XRCC7 containing 
YAC 943G4 (Fig. 2D). In the latter case, which judged from 
cytogenetics had an ins (8 ;7) (qii-12 ;p21pl4) , FISH analysis 
30 revealed signals on both the der(7) and the der(8) 
chromosomes, demonstrating that this is not a simple 
8:7-insertion but a complex rearrangement with at least two 
breakpoints on 8q, one of which maps within YAC 943G4 and 
the other proximal to this YAC. 
35 3.6 Assembly of a YAC contig between D8S260 and D8S507 
Two MegaYACs corresponding to D8S260, were used to 
start the chromosome walk from the telomeric side of chromo- 
some 8ql2.The YAC insert-ends from these two clones (783H12 



BNSDOCID: <WO 9807748A2J_> 



wo 98/07748 




PCT/EP97/04759 



21 



and 814A6) were isolated and used for STS content mapping. 
The most telomeric STS {CH31) , is present in two YACs also 
containing D8S510^ and hence links the 8ql2 YAC contig to a 
8ql3 specific YAC contig (Koenig et al., 1995). When we 
5 started our physical mapping effort, only one polymorphic 
marker, D8S507, was described in between D8S285 and D8S260 
(Gyapay et al. , 1994) . Initially, we were unable to isolate 
YACs containing D8S507. In stead we used the corresponding 
amplimers to isolate a cosmid containing D8S507 (cosmid 

10 CEM3). An insert-ends of this cosmid clone (EM73) was then 
used to isolate corresponding MegaYACs (872E1, 882D9 and 
957C4) . It is unclear why these YACs were initially not 
recognised upon the YAC library screenings. This might have 
been due to either a less efficient PCR of D8S507 than of 

15 EM73, or to an underrepresentation of the D8S507 YAC-contai- 
ning yeast cells in the primary pools of the specific 
library we screened. The availability of two new polymorphic 
markers, D8S1505 and D8S1515 (WI-6879) allowed to link the 
two separate MegaYAC contigs around D8S260 and D8S507. The 

20 composite contig consists of 23 CEPH MegaYACs and covers 

approximately 5 Mb of genomic DNA (Fig. 3). The polymorphic 
markers that were tested and found to reside in the telome- 
ric YAC contig are D8S1151 {WI-1155) , D8S260, D8S1075 
(WI-943) , D8S1505, D8S1515 (WI-6879), D8S1723, D8S507 and 

25 D8S1957 (WI-9507) . 

One gene, CYP7, encoding cholesterol 7a-Hydroxylase 
(and previously assigned to 8qll-12 using both mouse-human 
somatic cell hybrids and FISH (Cohen et al., 1992)) was 
mapped in this contig using a primerset that we developed on 

3 0 publicly available sequence data (see Table 3) . 

The YAC contig was constructed in parallel with the scree- 
ning data of Whitehead Institute / MIT Center for Genome 
Research available via http://www-genome.wi.mit.edu/ 
cgi-bin/contig/lookup_contig (Contigs WC8.8 and WC-727) . 

35 

AuL. Discussion 

In this example the physical map of 2 Mb of the 
centromeric region of chromosome 8ql2 composed of 34 YAC 
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clones is presented. It is tagged by 31 markers with an 
average interval of 65 kb and was validated by FISH mapping. 
The frequency of chimaerism in the set of YACs studied in 
detail (YAC end rescue and/or restriction enzyme mapping 
5 and/or FISH) was slightly lower (6/26 = 23%) than the 30-40% 
expected (Van Ommen, 1993). These data provide the most 
detailed available physical map of the 8ql2 region. The map 
was constructed in parallel with the establishment of a 
low-resolution map of the human genome and with the 

10 screening data of the Whitehead Institute/MIT Center for 
Genome Research. It was compared with data available 
electronically via Internet, The order of the markers on the 
physical map is in agreement with the 1996 Genethon human 
genetic linkage map (Dib et al. (1996)). In total, 

15 31 markers/genes were ordered on the contig as follows: 
D8S1069-CH274-TC65.2-AFMB055WG9-CH277-CH34-DBS125— 
D8S108-EM76-D8S1661-CH122-D8S96-D8S166-CH69-CH33-D8S1816-PE- 
NK-CH280-CH129-EM156-END2-CH283-D8S285-MOS-D8S1516-LYN-D8S1- 
828-IB2045-D8S165-CH37-W1086. 

20 A restriction map with the enzymes BssHIl, Kpnl, Mlul, 

NotI, Sail, Sfil, Swal was generated, revealing at least 7 
putative CpG islands. Three of these island correspond to 
previously known genes (PENK, MOS and LYN) , while the others 
might represent the 5' end of unidentified genes so far. Our 

25 2 Mb map integrates cytogenetic, genetic and physical data 
of chromosome band 8ql2 . Chromosome band 8ql2 seems to be 
very gene-poor: so far only 4 genes were shown to reside in 
both contigs (CYP7, PENK, MOS and LYN). This is consistent 
with the view that Giemsa-positive bands are chromosome 

30 regions rather low in G+C content and poor in genes (Craig 
and Bickmore, 1993) . Chromosome 8ql2 is partially conserved 
in synteny with mouse chromosome 4 (Mock et al., 1996). 
Comparative gene mapping however did not reveal any mouse 
gene or marker not present on our physical map. 

35 Our physical map contains at least one disease locus, 

i.e. the pleomorphic adenoma translocation breakpoint. FISH 
analyses using YACs and cosmids revealed that the majority 
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of breakpoints clustered within a 300 kb region in both 
•adenomas with the recurrent t(3;8) and in those with 
different sporadic 8ql2 translocations. A cosmid from this 
region was also shown to span the breakpoint in an adenoma 
5 with an ins(8;3) (RSijer et al. (1997)). This region is 

therefore likely to harbor the pleomorphic adenoma gene. The 
breakpoint region contains one known gene (HQS) , one 
polymorphic marker (D8S285) , and three new STSs (EM156, END2 
and CH283). BLAST searches of these STSs revealed that CH283 
10 displayed sequence identity with a publicly available EST 
(expressed sequence tag) . 

Two of the eight tumors in the present series had 
breakpoints outside the 300 kb breakpoint region. By FISH we 
could demonstrate that the breakpoints in both cases were 
15 clearly proximal to MOS . In one case the breakpoint was 

mapped between MOS and YAC 898G12, and in the other case it 
was found to reside within YAC 943G4. The latter YAC maps at 
least 2 Mb centromeric to MQS. Since the two breakpoints are 
clearly different and also separate from those located in 
20 the 3 00 kb breakpoint region it is unlikely that they affect 
the same gene. However, we can of course not rule out the 
possibility that these cases have additional rearrangements 
that may have escaped detection with the DNA probes used, 
and that they in fact all have rearrangements of a common 
25 gene in the breakpoint region. 

Recently, we have obtained molecular evidence for an 
additional breakpoint region distal to our telomeric contig 
(Roijer et al. (in manuscript)). Based on FISH using YACs 
derived from 8ql3-21 we have estimated that these 
30 breakpoints are about 15 to 2 0 cM telomeric to the 8ql2 
breakpoint cluster region. Cytogenetically , these 
breakpoints are very difficult, or impossible to distinguish 
from the classical 8ql2 breakpoints. This is also true for 
the breakpoints located centromeric to MOS . Collectively, 
35 our findings thus suggest that there are several genes in 
the proximal part of 8q that are affected by chromosome 
rearrangements in pleomorphic adenomas. 
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In addition to pleomorphic adenomas there are also a 
few other types of solid tumor with structural 
rearrangements involving 8qll-i3, namely lipoblastomas (Dal 
Cin et al. (1994); Sawyer et al. (1994)), rhabdomyosarcomas 
5 (Mitelman (1994)) and renal cell carcinomas (Elfving 
(1996)). It should also be mentioned that pleomorphic 
adenomas of the lacrimal glands show recurrent 
rearrangements of 8ql2 (Hrynchak et al. (1994)), including a 
t(3;8) (p21;ql2) . Because of the well known histopathological 

10 and clinical similarities between pleomorphic adenomas 
originating from the salivary and lacrimal glands it is 
likely that the 8ql2 abnormalities in these tumor types 
affect the same gene. As for the other tumor types mentioned 
it remains to be shown whether the rearrangements in these 

15 cases also affect this gene. Preliminary FISH analysis of 
lipoblastoma with an 8ql2 translocation breakpoint have 
shown that the breakpoint, as in pleomorphic adenomas, is 
distal to MQS (unpublished observations) . Unfortunately no 
markers telomeric to MQS could be tested. FISH studies are 

20 now in progress to resolve these critical questions. 

Finally, a second YAC contig was established between 
D8S260 and D8S507, consisting of 23 CEPH MegaYACs and cove- 
ring approximately 5 Mb. Both contigs represent about 75% of 
human chromosomal band 8ql2, which extends over approximate- 

25 ly 9 Mb of DNA. They will provide molecular access to new, 
previously unidentified genes from this region and especial- 
ly to those directly affected by the chromosome 8ql2 aberra- 
tions in pleomorphic adenomas of the salivary gland. 

30 Figure legends 

Figure 1. 

Physical contig and STS marker map in proximal human 
8ql2. The upper line indicates the chromosome, with the 
35 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones are represented as lines together 
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with their names and are described in Table 2. STS locations 
are indicated by dashed lines. The length of the solid line 
indicates its approximative size. The size of YAC 391H2 and 
392A6 is too large to fit within the constraints of the map, 
5 suggesting chimaerism« In these cases, the lenght of the YAC 
corresponds only to the markers it contains. YAC end clones 
used as STS are represented by arrows. Below the contig the 
consensus restriction maps for BssHII (B) , Kpnl (K) , Mlul 
(M) , NotI (N) , Pvul (P) , Sail (S) and Sfil (Sf) are indica- 
10 ted. Verical arrows indicate putative CpG islands, defined 
as the colocalization of sites for (a) K, M, N; (b) K, M, P, 
S, Sf; (c) K, M, P, Sf; (d) K, N, S; (e) K, N, S; (f) B, N, 
s; (g) B, K, N, Sf. 

The pleomorphic adenoma gene region is indicated by a gray 
15 shaded box. 

Figure 2 . 

Physical contig and STS marker map in distal human 
8ql2. The upper line indicates the chromosome, with the 

20 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones and cosmid CEM3 are represented as 
lines together with their names. YAC end clones used as STS 

25 are represented by arrows. 

Figure 10. 

STSs used to generate the 3 00 kb cosmid contig mapping 
at chromosome 8ql2 and encompassing PLAGl . 

30 
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TABLE 


2: Analysis 


of YAC 


clones in the centromeric contig 




CEPH code 


Size (kti\ 


Landm. left Acc. No. 


Landm. right Acc. No. 


Chimeric 


518G7 




CH131 






Y (left) 


518G12 




CH132 






ND 


770B2 


1410 


CH281 




CH286 


NO 


878F2 


800 


CH282 




CH287 


ND 


255H10 


390 


CH273 




CH272 


ND 


53ID4 


430 


CH274 




CH277 


ND 


15H11 


430 








ND 


29ipi2 


410 


CH29 




CH34 


Y (left) 


47E9 


370 








ND 


26F6 


460 








ND 


916F12 


1230 


CH122 




CH128 


ND 


388D6 


430 


CH69 




CHIOO 


ND 


297E11 


350 


CH280 




CH285 


ND 


253H7 


160 


CH279 




CH284 


ND 


143D5 


260 


CH278 




CH283 


ND 


391H2 


620 








Y 


392A6 


600 








Y 


84E12 


500 


CH289 




CH290 


ND 


166F4 


700 


CH67 




CH73 


ND 


164H5 


375 


END2 




END6 


ND 


206F11 




ENDS 




END7 


Y (right) 


253 C7 


620 


END4 




ENDS 


ND 


295A4 




CH68 




CH74 


Y (left) 


917E11 


1380 


ENDl 




ENDS 


ND 


946B7 


1370 


CH123 




CHI 29 


ND 


935E9 


1660 


CH33 




CH37 


ND 



Note. YAC clones were isolated from CEPH YAC libraries as described under Materials and 
Methods. ND. not delected by methods used (FISH, YAC end rescue, restriction enzyme 
mapping). GenBank accession numbers are given. 
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TABLE 3: 8ql2 STS primer sequences, annealing temperatures and 
expected PCR product sizes 

Amplimers on published sequences centromeric contis 

STS name Nucleotide sequence S' - 3' Product Ta (°C) 

size (bp) 



LYN 



GGAAGGAAAGGAAAGGAGA 
GGTTTGGGTGTTTGGTGTG 

GAGCTCAACGTAGCAAGGCT 
AACTGTCCrCCAGTGCGG 



194 



203 



60 



60 



PENK 



TAATAAAGGAGCCAGCTATG 



75-83 



58 



D8S96 



TCTCTACCTCGACATACrCCTGC 
GTCAGAAGTGCCAATCAGACTG 



113 



60 



D8S106 



CAAACCTTGAATTACAAAACAG 
TGTTAATATTAGACCACCTTTC 



1)6 



58 



D8S125 



TTCTGTTTCCTGGTCTCAGTAGC 
CACATGCATATGTGTATGTGTGTG 



144 



62 



D8S165 



ACAAGAGCACATTTAGTCAG 
AGCTTCATTTTTCCCTCTAG 



138-152 



58 



D8S166 



GATTGTGTCATTGCACTCCA 
ACAAGGAAGTTCCmTTGG 



116 



58 



D8S285 



GCATCACACAGAATCTTTG 
ATGGGTTTATGGCCTTTAC 



108-124 



60 



D8S1069 



AGCACAGTGGATATTTTTAGGC 
GGGGCTCACACAGAAGTTAA 



221 



60 
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D8S1516 



D8S1661 



D8S1816 



D8S1828 



GTCCCCCATCAACATGCTG 174 60 

CTCATCrTOmTCATAGTOTTCC 

TCrCATGCATGTTTCCTGTTG 126 60 

GTTGGGGTCATTAAACACTAGTCA 

TGCACCCTTAAAAAGCATCG 143-147 60 
ACTTGCGAACATGGGATCAC 

AGTGCTGTTTTTACrTCTGTACG 198-238 60 
GCAAGACTCTGTCTCAGGA 



a|F=MB055WG9 CTCCCAACCCACCCGAC 

TGAAAACCATAATCrCTGATGTTGC 



218 



60 



Amplimers on newlv isolated STSs centromeric contie 
STS name Nucleotide sequence 5' - 3' 



Product Ta ('C) 
size (bp) 



CH33 



CH34 



CH37 



CCTTTGGCTGGGGTTTATA 
GGCCTATGAAGCAAGAGAG 

GTACCCAGAAGGCAAGTAA 
GTGAAAAGGCAGAAATTAG 

TTGGATGAGAATGGAAATG 
GGCGTTACrTTCCTTTTGT 



168 



145 



176 



60 



60 



58 



CH69 



CH122 



AGTGCTTACAATAGGGTGAG 
CCATCCAGAAAGACCATAAT 

TTTGTCTTTGATTTTTATGG 
TGACCAACATACTGCCTAGT 



336 



250 



60 



57 



CH129 



CTGAATCCCAGAACAATATA 
AGGGTAAGTATGTCCnTAA 



110 



60 
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ATAATG7TGAGACTTTGAGA 
AAATGTTTATCCTAATTGTA 



157 
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58 



CH274 



CAGGTGAGTGGATGGTGTAA 
CAAGGGGAGACCAAATCATC 



241 



60 



CH277 



AATGGCTATGAGGTTGTTTT 
CACATCCTTTCATTTTAGCA 



122 



58 



CH280 



GGGCTGATGTTCCATTAACT 
GCTTCA ACA CCA A A AATGCr 



163 



58 



EM76 



CTGGGAAGAGATCAAAATTC 
TAAAGAGACAGCACCACAAA 



220 



60 



AU 1 AUCAGCAGCAACAGTCA 
TGCGCTATTCAGAGAAGATG 



160 



60 



EM216 



CAGTCAGTTCCAGAGGTCATTT 
TAGGGAGGGCTTTAATAGTGTT 



255 



55? 



ENDl 



GCTCACTTCACTCCTACCC 
CAA CCA ACCACTAA AA ACQ 



161 



60 



END2 



GTGATTTTACAGCCATTTT 
TGTAATTTTCAACCAGAAG 



91 



50 



TC65.2 



TACAAACCGGGAGAAAACAG 
TTACAGCATTTCCGATTTTG 



232 



58 
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Amplimers on published sequences centromeric contig 

STS name Nucleotide sequence 5' - 3' Product Ta i'C) 

size (bp) 



CYP7 



TTGACrill AAATATGATAGGTAT 1600 
ACmTATTTCTGAAAGATGAATCA 



55 



D8S260 



AGGCTTGCCAGATAAGGTTG 
GCTGA A GGCTGTTCTATGG A 



187-213 



60 



D8S507 



TTCCTCAGAGCAGTTCAAAG 
TAATCTTGCCCCAGTGAGAG 



155-169 



60 



D8Sni3 



ATGCAAAGATGAACCAGGAA 
CCCTGGACTCATGGTACrTG 



216 



62 



D8S1505 



GGATTTTAAGTTTCTACAAAGGGA 328 
ACAATTTCCATGAAGTGTTCACC 



58 



D8S1723 



AGGTCAATGGCACGTCCTTT 
ACTGCTGACTCAGAGCCTGG 



235-243 



60 



Amplimers on newly isolated STSs telonneric contig 



STS name 



Nucleotide sequence 5' - 3' 



Product Ta CC) 
size (bp) 



CH31 TCACAGAATAATACAGGAT 220 58 

GATCACTGATGATACTAGG 

CH32 ATTTGCCTCAGTGTTGCAG 245 62 

ATTTTCAGGAGGTCAGGGA 

CH35 CAAAATGACTTATGCrGAA 165 60 

TCTATACAGGGCATTGTGA 
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EM73 AAAGCAAGACCCTGTAAAGC 351 60 

CTTGGGCrcrATTTTGTGAA 

EXAMPLE 2 

5 1. Introduction 

In this example the identification of a member of the 
FLAG gene family that appears to be of pathogenetical rele- 
vance is described* In previous experiments, it was found 
that the segment between HQS and PENK on human chromosome 
10 8ql2 harbors the recurrent chromosome t(3;8) (p21;ql2) 

breakpoint frequently found in pleomorphic adenomas of the 
salivary glands. Using a positional cloning approach, the 
PIAGl gene was identified and its genomic organization 
character i zed • 

15 The largest cytogenetic subgroup of pleomorphic adenoma 

of the salivary glands carries chromosome 8ql2 aberrations 
with 3p21 as preferential translocation partner. Here we 
demonstrate that the t (3 ;8) (p21;ql2) results in promoter 
swapping between PLAGl , a novel, developmentally regulated 

20 zinc finger gene on 8ql2, and the constitutively expressed 
gene for B-catenin ( CTNNBl ) , a protein interface functioning 
in adherens junctions and the WG/WNT signalling pathway. 
Fusions occur in the 5 ' -non-coding parts of both genes, 
exchanging regulatory control elements while preserving the 

25 coding regions. Due to the t ( 3 ; 8) {p2 1 ;ql2) , PLAGl is activa- 
ted and expression of CTNNBl down-regulated. Activation of 
PIAGl was also observed in an adenoma with a variant trans- 
location t(8;15). The results indicate that PLAGl activation 
due to promoter swapping is a crucial event in salivary 

30 gland tumor igenesis . 

Recently, molecular insight was obtained into the 
genetic basis of benign tumors by the discovery of a common 
genetic denominator in tumors of different types but with a 
particular cytogenetic profile (Schoenmakers et al. (1995); 

35 Ashar et al. (1995)). Molecular analysis of recurrent 

chromosome 12ql3-15 aberrations in pleomorphic adenoma of 
the salivary glands, lipoma, uterine leiomyoma, hamartoma of 
lung and breast, fibroadenoma of the breast, angiomyxoma. 
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and endometrial polyps revealed the consistent involvement 
of the high mobility group protein gene HMGIC ; the first 
"benign oncogene". 

Evidence is mounting that the same is likely to hold 
5 true for the closely related family member HMGIY ^ which is 
located on the short arm of chromosome 6. Both genes belong 
to the high mobility group (HMG) protein gene family (Bustin 
et al. (1990). Their corresponding proteins possess three 
amino- terminal AT hooks, through which the proteins are 

10 assumed to bind to A/T-rich DNA sequences, and an acidic 
tail in the carboxy-terminal region. Functionally, they act 
as architectural factors in the nuclear scaffold, critical 
for the correct assembly of stereospecif ic transcriptional 
complexes (Wolffe (1994) ) . 

15 The genetic aberrations frequently result in the 

separation of the three AT hooks from the acidic tail, 
although all the pivotal changes in the high mobility group 
protein genes that can lead to aberrant growth control are 
not yet known. It should be noted that a variety of chromo- 

20 some segments can act as translocation partner of 12ql3-15 
but each tumor type seems to have a preferential transloca- 
tion partner. At present, only the preferential t(3;12)(q27- 
28;ql5) translocation of ordinary lipoma has been 
characterized (Petit et al. (1996)). The gene affected on 

25 chromosome 3 is the ^jPP gene, which encodes a protein 

containing three LIM domains. LIM domains have been found in 
a growing number of proteins in mammals, amphibians, flies, 
worms, and plants and act as modular protein-binding 
interfaces (Schmeichel & Beckerle (1994)). LIM proteins 

30 mainly have a function in cell signalling and developmental 
regulation and include, for instance, transcription 
regulators^ proto-oncogene products, and adhesion plaque 
constituents. The preferential t(3;12) in lipoma results in 
the formation of an HMGIC/LPP fusion transcript encoding a 

35 hybrid protein consisting of the three DNA binding domains 
of HMGI-C and the LIM domains of the protein encoded by LPP , 

In an approach to systematically unravel the molecular 
pathogenesis of benign tumors, the inventors have focused on 
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other cytogenetic subgroups of these, since their mere 
existence indicate the presence of additional critical 
genes. It is of interest to establish whether such genes are 
functionally similar or different as compared to the high 
5 mobility group protein genes HMGIC and HMGIY > In this con- 
text, the inventors have now molecular ly characterized the 
largest cytogenetic subgroup of pleomorphic adenomas. This 
type of neoplasm is an epithelial tumor occurring primarily 
in the major and minor salivary glands. It is by far the 

10 most common type of salivary gland tumor, accounting for 
almost 50 % of all tumors in these organs (Seifert et al. 
(1990)). Pleomorphic adenomas are almost exclusively benign 
tumors, which only rarely undergo malignant transformation. 
They show a marked histological diversity with epithelial 

15 and myoepithelial cells arranged in a variety of patterns in 
a matrix of mucoid, myxoid, chondroid and, on rare 
occasions, even osteoid tissues* Cytogenetically, 
pleomorphic adenomas are characterized by recurrent 
rearrangements, in particular reciprocal translocations with 

20 consistent breakpoints at 3p21, 8ql2 and 12ql3-15 (Mitelman 
(1994)). In addition to the cases with abnormal karyotypes, 
there is also a subgroup of tumors with apparently normal 
karyotypes (30 to 50% of the cases) . Abnormalities of 8ql2 
are most common and are found in about 60% of the cases with 

25 abnormal stemlines, whereas rearrangements of 3p21 and 
12ql3-15 are found in about 30% and 2 0% of the cases, 
respectively. The most frequent and characteristic 
abnormality so far observed in pleomorphic adenomas is a 
reciprocal t (3 ;8) (p21;ql2) translocation (Mark et al. 

30 (1980)). While chromosome 3p is the preferential 

translocation partner, most human chromosomes have been 
found as translocation partner of 8ql2. Rearrangements of 
8ql2 are often found as the sole anomalies, indicating that 
they represent primary cytogenetic events of possible 

3 5 pathogenetic importance . 

Here, the inventors describe the positional cloning of 
the 8ql2 translocation breakpoint as well as the identifica- 
tion and characterization of the genes on chromosome 8ql2 
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and 3p21 disrupted by the t(3;8) translocation • The gene on 
chromosome 8ql2 is a novel zinc finger gene, which we have 
designated PLAGl, whereas the gene on 3p21 is CTNNBl which 
is the translocation partner, the gene for B-catenin, a 
5 protein with an established role in cell adhesion and signal 
transduction (Peifer (1993)). By Southern blot and RACE 
analysis, we pinpoint the breakpoints within the introns of 
the 5*-noncoding regions of both genes and show that well- 
defined fusion transcripts are expressed in the tumor cells. 

10 We also demonstrate that the t(3;8) translocation results in 
specific disruption of the transcriptional control of the 
two genes, leading to activation of PLAGl and down- 
regulation of CTNNBl . A pleomorphic adenoma with 8ql2 but 
not 3p21 involvement has also been studied to evaluate the 

15 phenomenon of disruption of transcriptional control in a 
broader perspective. 

2 . Materials and methods 

2.1 Tumor material and chromosome analysis 

20 Primary pleomorphic adenomas of the salivary glands 

were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were made and analyzed 
as described by Roijer et al. (1996) . Primary pleomorphic 
adenomas, including CG368, CG580, CG644, CG752, CG753 and 

25 T9587, which all carry the recurrent t (3 ;8) (p21;gl2) as the 
sole anomaly, were selected for molecular analysis, as well 
as CG682, showing an ins(8 ;3) (gl2 ;p21. 3pl4 . 1) , and CG588 
which carries a t (8;15) {ql2;gl4) . CG tumors are obtained 
from Department of Pathology, Goteborg University, 

30 Sahlgrenska University Hospital, S-41345 Goteborg, Sweden. 
The tumor identified as T9587 is obtained from University 
Hospital Ghent, Belgium. FISH analysis was performed as 
previously described {Roijer et al. (1996)). Slides were 
examined in a Zeiss Axiophot epif luorescence microscope 

35 using the appropriate filter combinations. Fluorescence 
signals were digitalized, enhanced and analyzed using the 
ProbeMaster FISH image analysis system (Perceptive 
Scientific Instruments, Houston, Texas) . Color prints were 
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produced using a Kodak XL 7700 monochrome continuous 
printer . 

2.2 Preparation and analysis of DNA and RNA 

5 Extraction of genomic DNA and Southern blot analysis 

were performed as described previously ( Schoenmakers et al. 
(1994)). Total RNA was extracted from biopsy samples using 
the TRIZOL (6ibco/BRL) method and Northern analysis was 
performed according to standard procedures (Sambrook et al. 

10 (1989)). DNA from YACs, cosmids, PGR products, and 

oligonucleotides was labelled using a variety of techniques. 
For FISH, cosmid clones or inter-Alu PGR products of YAGs 
were biotinylated with biotin-li-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio- 

15 labelled with ek-^^P-dCTP using random hexamers (Feinberg & 
Vogelstein (1984)). In case of PCR-products of T-genes of 
the invention smaller than 200 bp in size, a similar 
protocol was applied, but T-gene specific oligonucleotides 
were used to prime labelling reactions. Oligonucleotides 

20 were labelled using y-'^P-ATP. 

2.3 YAC, cosmid, phage, and cDNA libraries 

YAC clones in this paper were isolated from the CEPH 
mark l YAG library (Albertsen et al. (1990)), using a 

25 combination of PCR-based screening and colony hybridization 
analysis (Green & Olson (1990)). YAC DNA was isolated and 
characterized as described before (Schoenmakers et 
al.(1995); Schoenmakers et al. (1994)). Cosmid clones were 
isolated from an arrayed human chromosome 8-specific cosmid 

30 library (Wood et al. (1992)) obtained from Los Alamos 

National Laboratory (LANL) . LANL-derived cosmid clones are 
indicated by their unique microtiter plate addresses. Phage 
clones were derived from a genomic library constructed with 
Li-l4/sV40 DNA (Schoenmakers et al. (1994)) in XFIXII 

35 according to standard procedures. Cosmid and phage DNA was 
extracted using standard techniques involving purification 
over Qiagen tips (Qiagen) . Positive cDNA clones were 
identified in a mixed poly-dT/ random-primed library in >.gtll 
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constructed from human fetal kidney (Clontech) • Library 
screening was performed by plague hybridization using 
various DNA probes, derived from subsequently isolated cDNA 
clones, according to the manufacturer's instructions, 

5 

2.4 DNA sequencing and computer analysis 

Nucleotide sequences were determined according to the 
dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Cycle Sequen- 
10 cing System (GIBCO/BRL) • Sequencing results were analyzed 
using an A.L.F. DNA sequencer^** (Pharmacia Biotech) on 
standard 30 cm, 6% Hydrolink*, Long Range^** gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI) . 

15 

2.5 PGR amplification of genomic DNA 

PGR amplifications were carried out essentially as 
described before (Schoenmakers (1994)). The following 
amplimers were used to generate a PLAGl exon 1 probe (5'-GAA 

20 TGG CTG CTG GAA AGA GG-3 » and 5»-CCC GTC CGC CGC CTC TAG 
AGC-3M/ a PLAGl ORF probe (5'-CGT AAG GGT GGT GAA ACC AAA 
C-3« and AGG GTC GTG TGT ATG GAG GTG A-3 • ) , a PLAGl 3 • -UTR 
probe (5'-ACA TGG CAT TTC GTG TCA CT-3 • and 5»-CCA CAA TGG 
CTC TAG AT-3') and a CTNNBl exon 1 probe (5 '-TGT GGC AGC AGC 

25 GTT GGC CCG GC-3 • and 5 '-CTC AGG GGA ACA GGC TCC TC-3 • ) . 

2.6 Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3* cDNA-ends (3 '-RACE) was 
performed using a slight modification of part of the 

30 GIBCO/BRL 3»-ET protocol. For first strand cDNA synthesis^ 
adapter primer (AP2) AAG GAT CCG TCG ACA TC(T)17 was used. 
For both initial and secondary rounds of PGR, the universal 
amplification primer (UAP2) CUA CUA CUA CUA AAG GAT CCG TCG 
ACA TC was used as "reversed primer". In the first PGR round 

35 the following specific "forward primers" were used: i) 5'- 
CAA TGG CTG CTG GAA AGA GG-3 • (exon 1) or ii) 5 •-AGA ATT TGG 
GCC TCA GAG AAG ATA-3 • (3«-UTR, exon 5). In the second PGR 
round the following specific forward primers (nested primers 
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as compared to those used in the first round) were used: i) 
5"-CAU CAU CAU CAU GGC CGG AGG GAG GAT GTT AA-3 • (exon 1) or 
ii) 5* -CAU CAU CAU CAU ATT GTC CTG GGT TGA TTA TGC AT-3 • 
(3»-UTR, exon 5) • CUA/CAU-tailing of the nested, specific 
5 primers allowed the use of the directional CloneAmp cloning 
system (GIBCO/BRL) . 

For 5 •-RACE experiments, the Marathon cDNA Amplificati- 
on kit (Clontech) was used according the manufacturer's 
instructions with minor modifications. The 5 • -untranslated 

10 end of the normal PLAGl transcript as well as the chimeric 
transcripts were isolated by 5 •-RACE. First strand placenta 
or adenoma cDNA respectively, was synthesized from 5 /ug 
total RNA using the MV2 primer (5 '-CTG CAC TTG ACC CAC CCC 
TTG GAT-3') located in exon 5. The ds cDNA was ligated to 

15 the adaptor and amplified using the anchor primer API and 
the MVS primer (5*-CAG GAG AAT GAG TAG CCA TGT GC-3 • ) also 
located in exon 5. A second round of PCR was performed using 
the anchor primer and the MV6 primer (5»-TGC ACT TGT AGG GCC 
TCT CTC CTG-3«) located in exon 4. The final PCR products 

2 0 were purified out of agarose gel and cloned into the pCRIl 
vector (Invitrogen) • 

2 . 7 RT-PCR 

Total RNA (5 fjig) was reverse-transcribed using Super- 
25 script II reverse transcriptase (GIBCO BRL) and oligo d(T) 
primers according to the recommended conditions. 0.25 tig of 
the resulting cDNA was subject to amplification using a 
variety of primer sets. The amplification conditions for the 
CTNNBl /PLAGl fusion transcripts were 3 0 cycles at 94 *^C for 
30 10 sec and 68 ^C for 1 min in a final volume of 50 /Ltl using 
the Expand long template PCR system (Boehringer Mannheim) . 
The first round PCR was carried out with the CTNNBl primer 
5 '-TGT GGC AGC AGC GTT GGC CCG-3 • (CAT-UP) and the PLAGl 
primer 5»-CAG GAG AAT GAG TAG CCA TGT GC-3 ♦ (MV5) . The 
35 second round was performed on a 20 fold diluted sample with 
the CTNNBl primer 5«-ACG GAG GAA GGT CTG AGG AGC AG-3 • 
(NECAT-UP) and the PLAGl primer 5 '-TGC ACT TGT AGG GCC TCT 
CTC CTG-3' (MV6) . To amplify the reciprocal PLAGl /CTNNBl 
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fusion transcript two rounds of PGR amplification were 
performed with 30 cycles at 94 for 30 sec, 63 ""C for 30 
sec and 72 ^C for 1 min in a final volume of 50 Ml* The 
first round was carried out with the PIAGl primer 5'-CAA TGG 
5 CTG CTG GAA AGA GG-3 ' (START-UP) and the CTNNBl primer 5'- 
AAG GAG CTG TGG TAG TGG CAC-3 • (CAT3), The second round was 
performed on a 20 fold diluted sample with the PIAGl primer 
5'-GGC CGG AGG GAG GAT GTT AA-3 « (START-RACE) and the CTNNBl 
primer 5 » -GCC GCT TTT CTG TCT GGT TCC A-3 * (CAT3NEST) . 

10 

1^ Results 

3.1 The chromosome 8ql2 breakpoint cluster region in pleo- 
morphic adenoma 

As part of a positional cloning effort to isolate the 

15 gene(s} affected by the t (3 ;8) (p21;ql2) in pleomorphic 

adenomas, we constructed a yeast artificial chromosome (YAC) 
contig consisting of 34 overlapping YACs, spanning about 2 
Mb within band 8ql2 (to be published elsewhere) « A long 
range STS and rare cutter physical map of this region was 

20 also developed* In previous experiments the t(3;8) break- 
point was mapped to a 1 Mb region flanked by MPS as proximal 
and by D8S166 as distal marker. One YAC within this region 
was shown to span the t(3;8) breakpoint in two tumors (CG588 
and CG644) • To further narrow down the breakpoint region we 

25 isolated cosmids corresponding to landmarks within this and 
other YACs in our contig (Fig* 3A) . By FISH, we could 
demonstrate that the breakpoints in four out of four adeno- 
mas tested (CG580, CG588, CG644 and CG682) clustered within 
a 300 kb subregion located between MPS and STS CH129 (Fig. 

30 3A) . Two cosmids from this region, CEM23 and CEM48, were 
shown to span the breakpoints in adenomas CG644 (Fig. 3B) 
and CG682 (data not shown) , respectively. To refine the 
distribution of breakpoints within this 300 kb subregion, we 
developed a contig consisting of 29 phage and cosmid clones 

35 (Fig. 3A) • 

3.2 Identification and characterisation of the PLAGl gene 
in the 8ql2 breakpoint region 
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While initiating experiments to identify candidate 
genes mapping within the 300 kb genomic segment harbouring 
the adenoma breakpoints, BLAST searches (Altschul et al. 
(1990)) of newly obtained STSs mapping within the 300 kb 
5 contig revealed that the right end (CH283) of YAC clone 

143D5 displayed sequence identity with a publicly available 
EST (expressed sequence tag) (GenBank Accession number 
D59273), Its position in the contig raised the possibility 
that this EST belonged to the candidate pleomorphic adenoma 

10 gene (EIAfil) , we were searching for. 

In initial Northern blot experiments using a PGR probe 
corresponding to this particular YAC end, a 7.5 kb 
transcript was detected* A combination of sequential scree- 
nings of a human fetal kidney cDNA library as well as 5»- 

15 and 3 » -RACE experiments led to the isolation of a composite 
cDNA of 7313 nucleotides (GenBank accession number U65002), 
This cDNA contains an open reading frame (ORF) of 1500 bp 
starting with the ATG at position 481-483 (Fig. 4A) . An in 
frame stop codon (TAG) is present 9 nucleotides upstream of 

20 this ATG, The deduced amino acid sequence reveals seven 
canonical C2H2 zinc finger domains (Fig. 4B) and a non- 
finger region of 259 amino acid residues representing the 
carboxy-terminus of the deduced protein. The zinc finger 
motifs including the linker sequences are between 2 8 and 3 5 

2 5 amino acids long. The cysteine (C) and histidine (H) residu- 
es are present in their characteristic positions in each 
finger. The typical phenylalanine (F) and leucine (L) resi- 
dues in finger 4 and 6 are lacking. Strictly spoken, the 
deduced protein is not a Kruppel zinc finger protein, since 

30 it does not contain the characteristic H/c linker (consensus 
sequence TGEKPYK) in between the zinc fingers (Bellefroid et 
al. (1989)). Only the seven amino acids between finger l and 
2 resemble the H/C linker (TGERPYK) . The amino-terminal 
region contains two nuclear localization signals (KRKR and 

35 KPRK) . The carboxy-terminus is serine-rich (45 amino acid 
residues out of 259, i.e 17%), raising the possibility of a 
regulatory role that may be controlled by serine/threonine 
kinases. 
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outside the ORF, some interesting features in the cDNA 
can be observed. In the 3' untranslated region (UTR) , which 
is 5333 bp long, a polyadenylation signal is present star- 
ting at position 7297, and there is also a TG-repeat, which 
5 might be of regulatory relevance. In addition, there is an 
ATTTA sequence motif, which has previously been shown to 
mediate roRNA destabilization in certain lymphokine and 
immediate early genes (Sachs (1993)). The ATTTA motif is 
repeated twelve times in the 3*-UTR. 

10 No sequence similarity was found to any of the known 

zinc finger proteins but preliminary studies have indicated 
that the human genome contains at least one additional gene 
that is closely related to our candidate pleomorphic adenoma 
gene. Nine anonymous ESTs (expressed sequence tags) with 

15 sequence similarity to sequences of our new gene were found 
in public databases and these ESTs could be aligned resul- 
ting in a contiguous sequence of about 1 kb. An open reading 
frame appeared to be present with coding potential for 
protein sequences with zinc-finger-like features. Sequence 

20 identity between the aligned ESTs and our new gene appeared 
to be about 75% at the amino acid sequence level; i.e. in 
the region of our gene encoding zinc fingers 4 to 7 • Of 
interest to note furthermore is that these anonymous ESTs 
sequences have been mapped to chromosome 6q24, which is 

25 implicated in tumors of the salivary glands. 

3.3 Genomic organization of the PLAGl gene 

Comparison of transcribed and genomic DNA sequences of 
the PIAGl gene revealed that it contains 5 exons (Fig. 3A) . 

30 The transcriptional orientation of the gene is directed 

towards the centromere. The first three exons are noncoding, 
the fourth exon contains the translation start site (ATG) , 
and the amino-terminus of the protein including one complete 
zinc finger domain. The second finger is split by intron 4 

35 and continues into exon 5, which contains the remaining part 
of the ORF and the long 3'-UTR (5533 bp) . Based on our VAC 
and cosmid maps, the PLAGl locus spans about 35 kb, with a 
large intron (approximately 2 5 kb) between exon 1 and exon 
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2. 3 '-and 5 '-RACE analysis revealed the existence of two 
alternatively spliced mRNAs which differ from each other by 
the presence or absence of the noncoding exon 2. The diffe- 
rence in electrophoretic mobility of the two mRNA isoforms 
5 is too small to distinguish them by Northern blot analysis. 
The functional relevance of this alternative splicing re- 
mains to be elucidated. 



10 



3.4 Rearrangements of £LAGl in primary pleomorphic adenoma 

In an attempt to establish whether the 8qi2 aberrations 
in primary pleomorphic adenomas with t (3 ;8) (p21;qi2) led to 
rearrangements in EL&Gl, Southern blot experiments were 
performed using various PlJ^-derived probes, with a probe 
within the ORF (STS probe EM265) or the 3«-UTR (KK64) of 
15 SLhSX, no rearrangements were detected. However, using 
probes corresponding to the 5* noncoding region of plagi . 
i.e. intron l (STS probes EM416 or EM317) or exon 2 (STS 
probe EM440), rearrangements were detected in each of the 
t(3;8) tumors tested (data not shown). Similar Southern blot 
20 analysis of primary pleomorphic adenoma CG580, which carries 
a t(8;i5) (ql2;ql4) , revealed also a rearrangement in the 5* 
noncoding region of PLAGi (Fig. 5) . These results seem to 
directly implicate fLAfil as a critical gene in pleomorphic 
adenoma tumor igenes is. Furthermore, the mechanism might 
25 involve transcriptional deregulation of plagi . since the 
rearrangements in PIAGl were invariably found in the 5« 
noncoding region of PjAgl leaving its coding region intact. 

3.5 t(3;8) results in promoter swapping between plagi and 

30 the gene for B-catenin, CTNNBi 

The Observation that the 8qi2 translocation breakpoints 
in various primary tumors were mapped to intron 1 or intron 
2 of the 5 '-noncoding region of PLAGI raised the possibility 
that the t(3;8) results in the production of a chimeric 

35 transcript consisting of sequences fused to those of a 

gene located on chromosome 3p2l. To test this possibility, 
5 '-RACE experiments were performed using total RNA of the' 
primary tumors CG644 and CG682, which both carry a 3; 8- 
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rearrangement. We used a primer specific for exon 5 (MV2) in 
the cDNA synthesis. Following ligation of an adaptor to the 
cDNA, PGR amplification was performed using, in the first 
round, an adaptor-specif ic primer and a EIASi-specif ic 
5 primer corresponding to sequences of exon 5 (MVS) and, in a 
second round, nested adaptor-specific primer and PLAGl - 
specific primer corresponding to sequences of exon 4 (MV6) . 
Nucleotide sequence analysis of the PGR product revealed 
that the ectopic sequences were fused to the acceptor splice 

10 site of exon 3 of PLAGl . BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNBl (Nollet et al. 
(1996)), the gene for 6-catenin, which has previously been 
assigned to chromosome 3p21 (Kraus et al. (1994)). 

Independent evidence for the involvement of CTNNBl in 

15 the t(3; 8) -rearrangements in pleomorphic adenomas was obtai- 
ned by FISH using two YAGs (756G5 and 750D3) , which contain 
the complete gene for 6-catenin. According to published maps 
YAG 756G5 is contained within the proximal part of mega-YAC 
750D3 (Gemmill et al. (1995)). As expected, both YAGs 

20 spanned the 3p21 breakpoint in adenoma GG682, with 

hybridization signals on the normal 3, the der(3) and the 
der(8) (data not shown) . Detailed analysis of a few 
prometaphase cells enabled us to sublocalize the B-catenin 
locus to band 3p21.3, which is in accordance with previous 

25 mapping data. 

To extend our observation to a larger number of tumors, 
we applied a reverse transcription-polymerase chain reaction 
(RT-PGR) approach. RT-PGR amplifications using primers 
specific for GTNNBl and PLAGl were carried out with RNA from 

30 tumors C6368, CG588, CG644 , GG752 , GG753 and T9587, which 
all carry the recurrent t(3;8), and from tumor GG682, which 
carries an ins(8;3) (ql2;p21.3pl4.1) . RNA from tumor CG580, 
which carries a t(8;15) , was included as a negative control, 
PGR experiments resulted in the generation of PGR products 

35 corresponding to hybrid transcripts consisting of PLAGl and 
CTNNBl secfuences, in seven out of seven t(3;8) tumors analy- 
zed (Fig. 6). In tumors GG368, CG588, GG682, GG752, and 
T9587, PGR products of 509 bp (Fig. 6A, PGR product A) and 
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614 bp (Fig. 6A, PCR product B) were generated, whereas in 
tumors CG644 and CG753, only the PCR product of 509 bp was 
found. The PCR product of 509 bp (from NECAT-UP up to MV6) 
is corresponds to a hybrid transcript containing exon 1 of 
5 CTNiipi and exons 3 to 5 of PLAGi . The PCR product of 605 bp 
contains an extra 105 bp, which corresponds to the alterna- 
tively spliced exon 2 of PIAGl . it points towards the pre- 
sence of a related isoform consisting of exon l of CTNNBi 
and exons 2 to 5 of PLAGj. This was also confirmed by nucle- 
10 otide sequence analysis of the PCR products. We were also 
able to demonstrate that the corresponding reciprocal fusion 
transcripts are expressed, m all tumors except CG682, a PCR 
product of 13 0 bp was generated corresponding to a fusion 
transcript consisting of exon 1 of PLAGI and exons 2 to 16 
15 of CTNNBI . In tumor CG753, an additional PCR product was 

detected, corresponding to a fusion transcript consisting of 
exons 1 to 2 of pi^GJ, and exons 2 to 16 of CTNNBi . This 
additional band was also observed but with weak intensity in 
tumor CG644. All these results indicate that in CG644 and 
20 CG753, the 8ql2 translocation breakpoints are located in 
intron 2, whereas the breakpoints of tumors CG368, CG588, 
CG682, CG752, and T9587 are located in intron 1. Interestin- 
gly, using 5* RACE analysis with the tumor CG580, which 
carries a t(8;l5) translocation, we found that the break- 
25 point occurs also in the same region leading to a fusion 
transcript with ectopic fused to exon 3 of PLAGI . 
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Activation of ELhSl expression due to promoter swapping 
Northern blot analysis was performed to evaluate the 
effect of the t (3 ;8) (p21;ql2) translocation on the expressi- 
on levels of ELAfil and CTNNfil (Fig. 7) . As previously menti- 
oned, PLAGI is expressed as a 7.5 kb transcript, which was 
readily detected in human fetal lung, liver, and kidney but 
not in fetal brain. In adult tissues, the 7.5 kb transcript 
35 was not detected in human heart, brain, lung, liver, skele- 
tal muscle, kidney, pancreas, and salivary gland. Low levels 
of PLAQl expression was found in human placenta (Fig. 7A) . 
In contrast, the PTWHBl gene was ubiquitously expressed as a 
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3.8 kb RNA doublet in all tissues tested (Fig, 7B) . Similar 
results were obtained in the Northern evaluation of fetal 
and adult mouse tissues (data not shown) . 

Northern blot analysis was performed on two pleomorphic 
5 adenomas with t(3;8) and on one tumor with a variant t(8;15) 
(Fig. 7C) . In contrast to normal salivary gland tissue in 
which no PLAGl transcripts could be detected, the three 
tumors expressed a 7.5 kb transcript. This transcript does 
not correspond to the normal PLAGl transcript (exons 1 to 

10 5) f since an exon 1-specif ic probe failed to hybridize to 
this transcript (data not shown) . Instead, a probe specific 
for exon 1 of CTNNBl recognized this 7.5 kb transcript as 
well as the 3.8 kb doublet of CTNNBl transcripts in the two 
tumors with a t(3;8). In contrast, only the 3 . 8 kb doublet 

15 of CTNNBl transcripts was detected in adenoma CG580 with a 
variant t(8;15). The expression level of CTNNBl in this 
tximor was about twofold higher as compared to tumors CG644 
and CG682, which is expected since none of the CTNNBl alle- 
les is rearranged. Collectively, these results suggest that 

20 PLAGl expression in these pleomorphic adenomas is driven by 
the 5» regulatory sequences either of the CTNNBl gene for 
the cases with t(3;8) or of an as yet unknown gene on chro- 
mosome 15 for the case with t(8;15). These results indicate 
that the principal mechanism in pleomorphic adenomas with 

25 8ql2 rearrangements is activation of PLAGl expression, due 
to promoter swapping between PLAGl and a translocation 
partner gene, preferentially CTNNBl . Furthermore, expression 
of the translocation partner gene is likely to be down- 
regulated concomitantly; e.g. as demonstrated for CTNNBl. 

30 

Di^cqsgjQp 

We have characterized a novel gene, PLAGl, which we 
propose is a critical locus involved in the development of 
pleomorphic adenoma of the salivary glands. The gene was 
35 identified on the long arm of chromosome 8, close to the MPS 
proto-oncogene, as a result of a positional cloning project 
to molecularly define the t (3 ;8) (p21;ql2) , which is the most 
frequent chromosome aberration in pleomorphic adenomas. We 
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have also identified the translocation partner gene at 3p21, 
i.e. the STNHBi gene which encodes B-catenin. In addition to 
8ql2 and 3p21, chromosome rearrangements involving I2ql3-15 
are also frequently observed in pleomorphic adenomas. The 
5 high mobility group protein gene HMGIC was recently identi- 
fied as the gene consistently rearranged in these cases 
(Schoenmakers et al. (1995)). In conclusion, we have 
molecular ly defined the three main chromosomal 
rearrangements implicated in the genesis of pleomorphic 
10 adenomas. 

Structural rearrangement of the 8qli-i3 region are 
consistently found also in benign lipoblastomas. These 
tumors are considered to be unique variants of lipoma, 
occurring primarily in children before 3 years of age. Among 

15 the seven cases so far analyzed cytogenetically, rearrangem- 
ents involving 8qll-i3 were observed in six cases (Mitelman 
(1994)). Similar to pleomorphic adenomas, the translocation 
partners of 8ql2 are variable. The similarities in 
chromosomal pattern between these two tumor types suggest 

20 that EMGX is the target of the 8qii-i3 rearrangements also 
in lipoblastomas. 

We have also elucidated the molecular consequences of 
the t(3;8) translocation. The 8qi2 translocation breakpoints 
were invariably found in the 5 ' -non-coding region of plagi 

25 in all seven primary tumors tested. The breakpoints in the 
CTNHai gene were always found in the first intron. As a 
result of the translocation, the coding sequences of plagi 
are brought under control of the 5« regulatory sequences of 
the CliaiBl gene, and vice versa, resulting in the activation 

30 of PLAG^ and down-regulation of CTNNBi . Based on their 

expression patterns in normal fetal and adult tissues, it is 
clear that the regulation of expression of the two genes is 
very different. The CTywpi gene is highly and ubiquitously 
expressed, whereas PIAGI is developmentally regulated, with 

3 5 expression restricted to fetal tissues. It should be empha- 
sized that in adult tissues, PLAGI is either not expressed 
or expressed at very low levels. In the adenomas, both the 
CTfTNBJYPT,AGl and the reciprocal PLAGI /ctwmbi transcripts 
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were detected by RT-PCR; detection of the latter transcripts 
sometixnes required three rounds of PGR, indicating that the 
expression levels are very low. Our findings represent the 
first example of reciprocal exchange of expression control 
5 elements in solid tumors. Since the coding sequences of both 
genes are invariably preserved, the molecular mechanism 
could be classified as promoter swapping, resulting in 
activation of PLAGl and down-regulation of CTNNBl 
expression. 

10 Since the t(3;8) in pleomorphic adenomas invariably 

leads to activation of PLAGl , it is of interest to evaluate 
possible physiological implications. The PIAGl gene encodes 
a protein with a deduced molecular weight of 56 kDa and a 
deduced pi of 8.56. Analysis of the open reading frame of 

15 PLAGl reveals seven zinc fingers in the amino-terminal 

region. The carboxy-terminal region is rich in serine resi- 
dues. Furthermore, two potential nuclear localization sig- 
nals are present (residues 22-25 and 29-32). Collectively, 
this suggests that the PLAGl protein is a novel member of 

20 the large zinc finger gene family. Zinc finger motifs were 
originally identified as DNA binding structures in the RNA 
polymerase III transcription factor TFIIIA, which binds to 
the internal control region of the 5S RNA gene. TFIIIA- like 
zinc fingers are present in a variety of regulatory proteins 

25 found in higher and lower eukaryotes. This type of zinc 

finger motif consists of «30 amino acids with two cysteine 
and two histidine residues (C2H2) that stabilise the domain 
by tetrahedrally coordinating a Zn^* ion. A region of «12 
amino acids between the invariant cysteine-histidine pairs 

30 is characterized by scattered basic residues and several 
conserved hydrophobic residues. Most zinc finger genes 
studied so far encode polymerase II transcription factors. 
Apart from transcriptional modulation and control of RNA 
metabolism, chromatin packaging might also constitute an 

35 important activity through which zinc finger proteins exert 
their regulatory roles (El-Baradi & Pieler (1991) )• It 
should also be noted that the presumed role of HMGIC is also 
in chromatin modelling (Schoenmakers et al. (1995); Ashar et 
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al. (1995); Wolffe (1994)). The mammalian genome is known to 
contain a large number of C2H2 zinc finger genes (Bellefroid 
et al. (1989)) and the number of such genes implicated in 
cancer is growing steadily. 
5 The function of the serine rich carboxy-terminal part 

of PLAgl is unknown but it may have a regulatory function 
that can be controlled by serine/threonine kinases. If plagi 
encodes a DNA binding protein, as the presence of its zinc 
fingers suggests, the carboxy-terminal region might 

10 represent the transactivation domain. At least four 

different primary sequence motifs that characterize the 
activation domains are identified thus far, i.e. acidic, 
glutamine-rich, proline-rich and serine/threonine-rich. They 
likely represent regions that functionally interact with 

15 other proteins (Mitchell & Tjian (1989)). 

Activation of ELAfil due to promoter substitution may 
lead to deregulation of genes normally controlled by plagi . 
If overexpression of PLAGI is indeed a major molecular 
consequence of the 8ql2 translocations than this could 

20 possibly be achieved also by other mechanisms, including for 
instance an increase in gene copy number. In this context it 
should be noted that a small subgroup of pleomorphic adeno- 
mas show trisomy 8, often as the sole anomaly (Mitelman 
(1994)). In addition, trisomy 8 is a common numerical 

25 abnormality found in several histological subtypes of 
malignant salivary gland tumors as well as in different 
subtypes of leukemias and sarcomas (Mitelman (1994)). 

The preferential fusion partner of PLAGI is the CTNNBl 
gene, which encodes fi-catenin, a cytoplasmic protein of 

30 about 88 kDa (Gumbiner & HcCrea (1996)). Its frequent 
involvement in pleomorphic adenomas as found here might 
point towards a critical role of CTNNBl . Most likely, a role 
of SHiHSl is to provide an active promoter in front of the 
EL&Gl gene. However, since the observed promoter swapping 

35 between £L&6 and CTNNBl leads to down-regulation of CTNNBl 
expression, it may also lead to other physiological 
consequences, especially since B-catenin is a broad-range 
protein interface, as that has been implicated in highly 
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diverse processes, such as human colon cancer, epithelial 
cell adhesion, embryonal axis formation in XenoDUs . and 
pattern formation in Drosophila (Peifer (1993)). The S- 
catenin protein has also been found as structural component 
5 of adherens junctions (AJs) (Peifer (1993); Kemler (1993); 
Gumbiner (1996)), which are multiprotein complexes assembled 
around Ca^*-regulated cell adhesion molecules (cadherins) • 
B-Catenin binds directly to cadherins and acts as a protein 
interface between cadherin and the cytoskeleton. The 

10 cadherin-6-catenin complex mediates cell adhesion, 

cytoskeletal anchoring, and signalling, which are important 
processes for regulation of cell growth and behaviour- It 
has been suggested that AJ complexes may play a role in the 
transmission of signals for contact inhibition (Peifer 

15 (1993)). Down-regulation of B-catenin, as occurs in 

pleomorphic adenomas, could interfere with this transmission 
and lead to growth deregulation. Since B-catenin levels 
apparently are important physiologically, it might be that 
down-regulation of CTNNBl expression plays some role in 

20 pleomorphic adenoma development, possibly synergistically to 
PLAGl , In conclusion, as activation of PLAGl is observed in 
all pleomorphic adenomas with 8ql2 involvement tested, this 
seems to constitute a major factor in salivary gland tumor 
formation. The critical role of down-regulated levels of B- 

25 catenin remains to be established. 

Previous studies of HMGIC have shown that rearrange- 
ments of this gene preferentially occur in particular cyto- 
genetic subgroups of benign mesenchymal tumors, with the 
only exception so far being a subgroup of pleomorphic adeno- 

30 mas (Schoenmakers et al. (1995)). When considering the 

findings in the latter type of tumors, it should be noted 
that, although they are epithelial in origin, they often 
also show a varying degree of mesenchymal differentiation. 
This may be explained by the fact that pleomorphic adenomas 

35 are thought to originate from a pluripotent reserve cell in 
the terminal duct system which possesses the capacity to 
differentiate into both epithelial and myoepithelial cells 
(Evans & Cruickshank (1970)). The latter cells may function 
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as facultative mesenchymal cells, and are responsible for 
the production of extracellular material, including myxo- 
chondroid stroma (Dardick et al. (1991)). An intriguing 
question in this context is of course whether HMGIC is 
5 preferentially affected in adenomas with a prominent stromal 
component while ELA£1 is preferentially affected in tumors 
with little or no stroma. Histological re-examination of all 
but one of the adenomas with PlAGl involvement in this 
study, revealed that all cases contained mesenchymal 

10 components, including both myxoid and chondroid tissues, 
suggesting that mesenchymal differentiation is not 
restricted to adenomas with HMGic involvement. It should be 
noted that previous cytogenetic observations have already 
indicated that chromosome 12ql3-15 and 8ql2 rearrangements 

15 constitute mutually exclusive pathways for the evolution of 
pleomorphic adenomas (Sandros et al. (1990)). 

The identification and cloning of a novel "benign 
oncogene" activated by chromosome translocations constitute 
a major step towards an increased understanding of the 

20 molecular pathogenesis of benign neoplastic growth. Our 

findings may lead to new insights into the genetic differen- 
ces between benign and malignant tumors. In addition, the 
present findings may have diagnostic implications. The fact 
that PIAga, is not normally expressed in adult tissues, but 

25 activated in pleomorphic adenomas due to 8ql2 rearrange- 
ments, makes it a potentially useful tumor marker in the 
differential diagnosis of benign and malignant salivary 
gland tumors. On the basis of the fusion transcripts that 
are formed, highly specific diagnostic assays can be develo- 

30 ped, as will be illustrated in the following examples. 

FIGURE LEGENDS 
FigiM:« 3 

A: Contig of 3 overlapping YACs (bold lines) , 27 cos- 
35 mids and 2 phages, containing 27 landmarks and spanning a 
300 kb DNA region on chromosome 8ql2 . Contig elements are 
labelled or numbered and defined in the list below. Cosmid 
clones isolated from the arrayed chromosome 8-specific 
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cosmid library constructed at Los Alamos National Laboratory 
(LANL) (Wood et al, (1992)) are named after their unique 
microtiter plate addresses. #1 and #22 are genomic phage 
clones; #21 is a clone isolated from the non-arrayed LANL 
5 chromosome 8-specific cosmid library. The orientation of the 
contig on the long arm of chromosome 8 is given as well as 
the order of 27 landmarks. It should be noted that the 
contig is not scaled. Below the contig, the genomic 
organization and the relative location of the PLAGl gene is 

10 given schematically, with exact sizes (bp) of its exons and 
estimated sizes (kb) of its introns. Noncoding sequences are 
represented as open boxes and coding sequences as black 
boxes. The relative positions of the translation initiation 
(ATG) and stop (TAG) codons in PLAGl are indicated. At the 

15 bottom of the figure, characteristics of the deduced protein 
encoded by PLAGl are given. Zinc fingers are labelled F1-F7 . 



1 « PEM18«- CH33 -^DSSXXX 

2 - ^E9*- PENK -=D8SXXX 
20 3 - 41E12- PENK =DaSXXX 

4= 17B6^EMI72 «D»SXXX 
5- 93B2-EM172 -DSSXXX 

6 « 47DU *-CH280 «D8SXXX 

7 « 53B10*-CH280 »D8SXXX 
2 5 8= 66E5-CH280 »D8SXXX 

9 = 144A7 - CH280 =D8SXXX 

10 =I63D11 -CH280 =D8SXXX 



11 - I63E1 CH280 »D85XXX 

12 - 203B8 ^ CH280 »D8S>OOC 

13 = 50B7*-EM187 =D«SXXX 

14 « I15H9 - EM187 -D8SXXX 

15 «129BI0*-CH129 *D8SXXX 

16 = 116A2 -CHI29 =D8SXXX 

17 » 7B7 - CHI29 « D8SXXX 

18 4H4 EMIS6 D8SXXX 

19 = 4H5 - EMI56 = D8SXXX 

20 = 4H6 *■ EM1S6 := D8SXXX 



21 CEM55 *- EM193 -D8SXXX 

22 a PEM49*- EM317 =D8SXXX 

23 «149GI2 «'D8S285 

24 = 64CI ^ D8S285 

25 " 64D1 ^ D8S285 

26 « 143BI «- MOS - D8SXXX 

27 » I43HI - MOS -D8SXXX 

28 « 148E9 MOS ^ D8SXXX 

29 ^ 22B8 MOS ^ D8SXXX 



(clone number, cosmid or phage clone, STS, accession number) 

30 

B: Mapping of the 8ql2 translocation breakpoint in an 
adenoma (CG644) with a t (3 ; 8) (p21 ;ql2 ) . Cosmid CEM48 (white 
spots) was co-hybridized with an alpha-satellite probes one 
specific for chromosome 3 and the other for chromosome 8 
3 5 (black spots) . Hybridization signals were found on the 
normal 8, the der (3), and the der (8), indicating that 
CEM48 spans the t (3 ;8) (p21;ql2) breakpoint. Chromosomes are 
stained in blue with DAPI. 

40 Figura 4 

A: cDNA and deduced amino acid sequence of PLAGl . The 
relative positions of exon/intron boundaries are indicated 
by triangles (▼) . The conserved C, F, L and H residues in 
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the Zinc finger domains are underlined. Residues 22-25 and 
29-32 constitute two potential nuclear localization signals. 
A potential polyadenylation signal and the putative mRNA 
destabilizing ATTTA motifs in the 3'-noncoding region are 
5 underlined. The nucleotide sequence of the complete cDNA has 
been deposited at GenBank under accession number XXX. 

B: Alignment of the seven zinc-finger-like motifs found 
in the deduced amino acid sequence of PIAGI relative to the 
C2H2 consensus motif. The canonical C, F, L and H residues 
10 are given in bold. 



Figur* 5 

Illustrative example of the detection of DNA rearrange- 
ment in PLAGl of a primary pleomorphic adenoma using Sout- 
15 hern blot analysis. DNA of pleomorphic adenoma CG36ft rn^^noi 



I) and control DNA isolated from normal lymphocytes (panel 

II) were digested with restriction endonuclease BaroHI (B) , 
EcoRI (E), Hindlll (H) , or PstI (P) , as indicated. A probe 
with specificity for exon 3 of PLAGl (EM440) was used. The 

20 molecular weight markers are 23.1, 9.4, 6.5, 4.3, 2.3 and 
2.0 kb, respectively. 

Figure 6 

A: Detection of CTNNBi /PTAm and PLAGl /ctnwri fusion 
25 transcripts by RT-PCR in primary adenomas. I. CTNNBi /PT.An 
were detected using the RT-PCR protocol and primers descri- 
bed in detail in Methods. Primary tumors analyzed included 
CG368 (lane 1), CG588 (lane 2), CG644 (lane 3), CG682 (lane 
4), CG752 (lane 5), CG753 (lane 6), T9587 (lane 7), and 
30 CG580 (lane 8). li. PLAGl /CTNNBI fusion transcripts were 
detected similarly using the same samples as under "1'. 
Details of the primers used here are given in Methods Secti- 
on. PGR products are labelled A-D. 

B: Schematic representation of the nature and origin of 
35 CTjlNBl/pT.;V(?1 and PLAGI/ctwnbi fusion transcripts in primary 
pleomorphic adenomas with t(3;8) (p21;ql2) . At the top of the 
figure, the exon/intron distribution of the PLAGl gene is 
given, at the bottom, the exon/intron distribution for the 
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CTNNBl gene. Positions of chromosome breakpoints are indica- 
ted by an arrow (N ) • Translation initiation sites are indi- 
cated by asterisks (*) and stop codons by triangles (▼) . A- 
D: schematic exon compositions of hybrid transcripts, as 
5 established by 5 •-RACE analysis. A: cDNA sequence junction 
between exon 1 of CTNNBl and exons 3-5 of PLAGl . B: cDNA 
sequence junction between exon 1 of CTNNBl and exons 2-5 of 
PIAGl . C: cDNA sequence junction between exon 1 of PLAGl and 
exons 2*16 of cnflffil. D: cDNA sequence junction between 
10 exons 1-2 of PLAGl and exons 2-16 of CTNNBl . 

Figure 7 

A: Northern blot analysis of the expression pattern of 
PLAGl in normal human fetal tissues including brain (1), 
15 lung (2), liver (3), kidney (4) as well as adult tissues 
including heart (5), brain (6), placenta (7), lung (8), 
liver (9), skeletal muscle (10), kidney (11), and pancreas 
(12) . 

B: Northern blot analysis of the expression pattern of 
20 the CTNNBl gene in normal human fetal and adult tissues as 
described under "A" . 

C: Detection of CTNNBl /PLAGl transcripts in pleomorphic 
adenomas by Northern blot analysis using exon 1 of CTNNBl as 
a molecular probe. Lane 1, RNA of CG644 (t(3;8)), lane 2, 
25 RNA of CG580 (t(8;15)), and lane 3, RNA of CG682 (ins 3p21 
(8ql2)). The CTNNBl /PLAGl fusion transcript is indicated. 

D: Using the same blot as under "C", detection of 
CTNNBl /PLAGl transcripts using a probe with specificity for 
the 3' UTR of PLAGl (probe KK64) . The CTNNBl /PLAGl trans- 
30 cript is indicated. 

EXAMPLE 3 

The discovery of PLAGl made it possible to identify 
ELAGl-related genes. The possibility that the human genome 
35 contains such genes was raised by Southern blot data showing 
bands (weak hybridization signals). In this example, the 
identification of another member of the PLAG gene family 
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that might be of pathogenetlcal relevance is described, i.e. 
the PLAG2 gene. 

To isolate cDNA clones corresponding to other members 
of the ?hhQl gene family, a cDNA library in XZAP was used 
5 (Stratagene) . As molecular probe, the complete human PLAGl 
cDNA was used. Screening of the cDNA library was performed 
according to routine procedures using low stringency hybrid- 
ization conditions (Schoenmakers et al* (1994)). This 
resulted in the isolation of a number of overlapping cDNA 

10 clones presumably corresponding to another member of the 
PLAG family. From the inserts of these, a 2.7 kbp composite 
cDNA could be constructed that contained an open reading 
frame (1233 nucleotides) for a protein (411 amino acids) 
structurally similar to PLAGl. This protein was designated 

15 PLAG2. The 2.7 kbp human PLAG2 cDNA fragment started 17 6 
nucleotides upstream of the ATG start codon. 

The nucleotide sequence of the composite PIiAG2 cDNA and 
the deduced amino acid sequence of the corresponding PLAG2 
protein are shown in Figure 8. Nucleotide sequences were 

20 determined according to the dideoxy chain termination method 
using the T7 polymerase sequencing kit of Pharmacia/LKB . DNA 
fragments, subcloned in the pBLUESCRIPT or pGEM-3Zf (+) 
vector, were sequenced using standard primers and primers 
synthesized based upon newly obtained sequences. Nucleotide 

2 5 sequence data were obtained from both strands and analyzed 

using the sequence analysis computer programs Genepro (Ri- 
verside Scientific), PC/Gene and Intelligenetics (IntelliGe- 
netics, Inc. ) . 

To established that the isolated PLAG2 sequences cor- 

3 0 responded to expressed sequences. Northern blot analysis was 

performed according to routine procedures. Total RNA was 
extracted from various human tissues and cell lines using 
Trizol^** (GIBCO/BRL) . After precipitation, concentrations of 
RNA in the samples were estimated via spectroscopic determi- 
35 nation. 40 /zg of total RNA from each sample was run on 1% 
agarose gels containing l x MOPS (0.02 M MOPS (Sigma) pH 
7.0, 50 mM Na-acetate, 10 mM EDTA pH 8.0), and 0.66 M for- 
maldehyde (Sigma) in a 1 x MOPS buffer for 16 h at 4 V/cro. 
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After gel electrophoresis, RNA was transferred to Hybond N* 
filters (Amersham) using 10 x SSPE (20 x SSPE: 3.0 M NaCl, 
0.16 M NaHjPO^ pH 7.4, 0.02 M EDTA) for standard capillary 
blotting. After completion of the transfers, nylon filters 
5 were irradiated with ultraviolet light using a Stratalinker 
(Stratagene) . Prehybridizations of the blots were carried 
out for 3 h at 42 in hybridization buffer consisting of 5 
X SSPE, 10 X Denhardt's, 2% SDS, 50% formaroide and 100 ^ig/ml 
herring sperm DNA. For hybridizations, blots were incubated 

10 overnight in the same buffer as described for the prehybrid- 
izations. Subsequently, blots were washed for 40 min at room 
temperature in 2 x SSC containing 0.05% SDS and for 4 0 min 
in 0.1 X SSC containing 0.1% SDS at 50 *^C. Autoradiographs 
were obtained by exposing Kodak X-AR film at -80 *»C with 

15 intensifying screen. 

In Northern blot analysis using the complete 2-7 kb 
PLAG2 cDNA as molecular probe and rather stringent hybrid- 
ization conditions, four distinct mRNAs were detected. These 
varied in size and included mRNAs of about 4.4 kb, 3.1 kb, 

20 2.9 kb, and 1.9 kb. The 3.1 and 2.9 kb transcripts were 
preferentially seen in fetal tissue, whereas the other 
transcripts were seen preferentially in adult tissues. These 
results established the expression of the PLAG2 sequences in 
a variety of cell types. Furthermore, the detection of 

2 5 various mRNAs with the PLAG2 cDNA probe suggests the exis- 
tence of various isoforms and raises the possibility that 
various PLAG2 protein isoforms may be produced. 

EXAMPLE 4 

30 Detection of hybrid PLAGl in salivary gland tumor cells. 

cDNA clones of the chromosome 3 -derived CTNNBl gene 
were isolated and the nucleotide sequence thereof establis- 
hed. The nucleotide sequence data of a composite cDNA are 
shown in Fig* 9. The amino acid sequence of the CTNNBl - 
35 encoded protein was deduced. 

In 5 '-RACE analysis of primary pleomorphic adenomas of 
the salivary glands, PIAGl 
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containing fusion trancripts were identified. 5 •-RACE expe- 
riments were performed using total RNA of the primary tu- 
mors, A primer specific for exon 5 (MV2) was used in the 
cDNA synthesis. Following ligation of an adaptor to the 
5 CDNA, PGR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl-specif ic 
primer corresponding to sequences of exon 5 (MVS) and, in a 
second round, nested adaptor-specific primer and PLAGl - 
speclfic primer corresponding to sequences of exon 4 (MV6) . 

10 Nucleotide sequence analysis of the PGR product revealed 

that the ectopic sequences were fused to the acceptor splice 
site of exon 3 of EL^fil. BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNBl . the gene for 
6-catenin, which has previously been assigned to chromosome 

15 3p21, 

EXAMPLE 5 

Diagnostic test for pleomorphic adenoma of the salivary 
glands 

20 Fine needle biopsies of patients having a pleomorphic 

adenoma of the salivary gland (with a chromosome 8ql2 
aberration others with a normal karyotype) were taken. From 
the material thus obtained total RNA was extracted using the 
standard TRIZOL^^ LS protocol from GIBCO BRL as described in 

25 the manual of the manufacturer. This total RNA was used to 
prepare the first strand of cDNA using reverse transcriptase 
(GIBCO/BRL) and an oligo dT(17) primer containing an 
attached short additional nucleotide stretch. The sequence 
of the primer used is as described in Example 2, under point 

30 2.6 [AAG GAT CCG TCG ACA TC(T)17]. RNase H was subsequently 
used to remove the RNA from the synthesized DNA/RNA hybrid 
molecule. PGR was performed using a gene-specific primer 
(Example 2, point 2.6) and a primer complementary to the 
attached short additional nucleotide stretch. The thus 

3 5 obtained PGR product was analysed by gel electrophoresis. 
Fusion constructs were detected by comparing them with the 
background bands of normal cells of the same individual. 
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In an additional experiment, a second round of hemi- 
nested PGR was performed using one internal primer and the 
primer complementary to the short nucleotide stretch [AAG 
GAT CCG TCG ACA TC(T)17]. The sensitivity of the test was 
5 thus significantly improved. 



EXAMPLE 6 

Diagnostic test for uterine leiomyoma. 
10 Uterine leiomyoma is the most common pelvic tumor in 

women, occurring with an incidence up to 77% of women of 
reproductive age when tumors are counted after 2 millimeter 
serial sectioning of consecutive hysterectomy specimens. 
Often multiple leiomyomas are present, with estimates as 
15 high as an average of 6.5 tumors per uterus. Although most 
patients with these steroid-dependent tumors are asymptoma- 
tic, symptomatic leiomyomas can be associated with abnormal 
uterine bleeding, pelvic pain, urinary dysfunction, sponta- 
neous abortions, premature delivery and infertility. The 
20 high incidence of this benign smooth muscle tumor constitu- 
tes a major public health problem with the diagnosis of 
myomatous uterus leading to over 200,000 hysterectomies 
performed annually in the United States. Furthermore, non — 
random inactivation of the X chromosome has clearly demon- 
25 strated that uterine leiomyomas are monoclonal proliferati- 
ons. Although leiomyomata typically are benign tumors, in a 
very low percentage (< 0,1%) they might be able to metasta- 
size or even progress to malignancy. 

Besides a normal karyotype which is being found in 
3 0 approximately 70% of the cases investigated, several cytoge- 
netically abnormal subgroups, can be distinguished among 
leiomyomata. Leaving out of consideration the group which 
shows random changes, one of the largest cytogenetic sub- 
groups (comprising approximately 2 5% of the cytogenetically 
35 abnormal tumors) was first described in 1988 and is charac- 
terized by the involvement of 12ql5 and/or 14q23-24, mainly 
as t(12;14) (ql4-15;q23-24) . Another subgroup, with a similar 
incidence, contains deletions involving the long arm of 
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chromosome 1, with region q21-22 being the most probable 
commonly involved region. Another subset of leiomyomas is 
characterized by numerical aberrations, mainly trisomy 12. 
This trisomy is found in approximately 10% of the cytogene- 
5 tically abnormal leiomyomas. Furthermore, chromosome 6p21~ 
pter, has been found to be recurrently involved in roughly 
5% of the cases studied. Finally, a small percentage (ap- 
prox. 3.5%) of leiomyomata shows t(l;2) (p36;p24) . In uterine 
leiomyomata with chromosome 12ql4-15 aberrations, the HMGIC 

10 gene is affected and in those with chromosome 6p aberrati- 
ons, the IfllSix gene is affected. 

The observation that 70% of the uterine leiomyomata 
tested show a normal karyotype prompted studies to investi- 
gate whether activation of a PLAg gene could be found in 

15 uterine tumors. Therefore, Northern blot and PGR analysis 
were performed. 

Specimens of tumor material surgically removed from 
patients and classified by routine pathology as uterine 
leiomyoma, uterine leiomyosarcoma, fibro leiomyoma, or uteri- 

20 ne fibroma were used. From the material thus obtained, total 
RNA was extracted using the standard TRIZOL™ LS protocol 
from GIBCO BRL as described in the manual of the manufactu- 
rer. This total RNA was used in Northern blot analysis, 
which revealed activation of in about 50% of the 

25 tumors tested, indicating that specific PIAG activation is 
also observed in these types of tumors. These results were 
confirmed by PGR analysis. These results reveal the possibi- 
lities of using fLAfil as a diagnostic marker on routine 
biopsies taken from patients. Since ELAfil is implicated in 

30 this type of tumor, EL&Gl can be used as a target for the 
development of novel therapeutic strategies for uterine 
leiomyoma. 

EXAMPLE 7 

35 Diagnosis of lipoblastoma. 

Lipoblastomas are benign pediatric tumors that are 
assumed to result from proliferation of primitive adipocy- 
tes. These type of tumors often grow rapidly but there are 
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no reports that they metastasize. Because of the rapid 
growth and their histopathological features, lipoblastomas 
can mimic myxoid or well-differentiated liposarcoma which 
makes the diagnosis problematic. As a direct consequence, 
5 therefore, some of these tumors have been treated in an 
unnecessarily agressive manner, 

A recurrent rearrangement involving the long arm of 
chromosome 8 has been reported for lipoblastomas- Southern 
blot analysis detected a rearrangement in the PlAgi gene 
10 opening the possibility of using PIAGl as a diagnostic 
marker . 

A biopsy of a patient having a lipoblastoma was taken. 
From the material thus obtained total RNA was extracted 
using the standard TRIZOL^" LS protocol from GIBCO BRL as 

15 described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse 
transcriptase (GIBCO/BRL) and an oligo dT(17) primer contai- 
ning an attached short additional nucleotide stretch. The 
sequence of the primer used is as described in Example 2, 

20 under point 2.6. RNase H was subsequently used to remove the 
RNA from the synthesized DNA/RNA hybrid molecule. PGR was 
performed using a gene-specific primer (Example 2, point 
2.6) and a primer complementary to the attached short addi- 
tional nucleotide stretch. The thus obtained PGR product was 

25 analysed by gel electrophoresis. Fusion constructs were 
detected by comparing them with the background bands of 
normal cells of the same individual. 

In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 

30 primer complementary to the short nucleotide stretch. The 
sensitivity of the test was thus significantly improved. 

EXAMPLE 8 

PLAG2 as diagnostic marker. 
35 To evaluate the diagnostic potential of the PIAG2 gene, 

we have first identified on which human chromosome the PLAG2 
gene is located and subsequently established its subchromo- 
somal location. Such mapping studies could link the gene to 
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known tumor-specific chromosome anomalies. Using several 
different PLAG2 containing YACs and cosmids in FISH analy- 
sis, the £LaSi2 gene was tentatively mapped to chromosome 
band 6q24, a chromosome segment implicated in various tumors 
5 such as for instance malignant salivary gland tumors. 

Malignant salivary gland tumors are a heterogeneous 
group of tumors comprising at least 17 different entities 
according to the most recent WHO classification (1991) . 
Because of the large number of tumor types and the wide 

10 morphological spectrum that these tumors show they constitu- 
te a well recognized diagnostic problem. The most frequent 
chromosome abnormalities so far observed among malignant 
salivary gland tumors are deletions or translocations invol- 
ving 6q. Most of the deletions so far encountered are 

15 terminal deletions with breakpoints located at 6q22-q25. 

However, there are also a few cases on record with intersti- 
al 6q deletions. The minimal common region lost in the 
majority of all cases is 6q24-qter. The 6q deletions are not 
restricted to certain types of malignant salivary gland 

20 tumors, but seem to occur in all major histologic subtypes, 
including adenoid cystic carcinoma, adenocarcinoma, undiffe- 
rentiated carcinoma, mucoepidermoid carcinoma and acinic 
cell carcinoma. The frequency of 6q rearrangements vary 
somewhat in different tumor types. In for example adenoid 

25 cystic carcinomas clonal 6q deletions or translocations have 
been found in nearly 50 % of the cases. The fact that the 6q 
deletions are often the sole karyotypic change, indicate 
that they are primary cytogenetic events of pathogenetic 
importance to malignant salivary gland tumors. In addition 

30 to 6q deletions a subgroup of adenoid cystic carcinomas are 
characterized by a t(6;9) (q21-24;pl3-23) . This is a primary 
abnormality that is diagnostic for this tumor types. it 
should also be noted that in addition to malignant salivary 
gland tumors there are several other tumor types with recur- 

3 5 rent deletions of 6q, including e.g. various types of leuke- 
mias and lymphomas, malignant melanomas and neuroblastomas. 



EXAMPLE 9 
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involvement of EMSZ in malignant salivary gland tumors 

This example is performed to test whether the different 
6q deletions commonly seen in different histologic subtypes 
of malignant salivary gland tumors involve the PMG2 gene at 
5 6g24. 

8.1 Ma^-erial ^*pd methods 

FISH analysis using the PLAG2 containing CEPH mega-YACs 
798D8 and 921C2 was performed on metaphase chromosomes from 
10 two cases of malignant salivary gland tumors, one 

adenocarcinoma and one adenoid cystic carcinoma, with 6q 
deletions. The methods for cytogenetic analysis and FISH 
were the same as described for PIAGl. 



15 8.2 Results 

Hybridization with both YACs on metaphases from the 
adenocarcinoma resulted in signals of the same intensity on 
both chromosome 6 homologs at the expected position. There 
was no evidence of deletion of PIAG2 signals in this case. 
20 In contrast, the adenoid cystic carcinoma contained a 

subpopulation of cells, which in addition to the signal from 
the normal chromosome 6 showed either no signal or a weak 
signal from the 6q-marker chromosome. Both YACs gave a 
similar hybridization pattern. 

25 

8.3 Conclusion 

Preliminary studies suggest that at least one of the 
breakpoints in the 6q deletions in malignant salivary gland 
tumors map in the vicinity of the ELAG2 locus. PLAG? is 
30 therefore a candidate gene for the commonly ocurring 6q 
deletions in malignant salivary gland tumors. 



EXAMPLE 10 

Animal tumor models involving PLAGl as tools in in vivo 
35 therapeutic drug testing 

on the basis of the acquired PLAGl knowledge, animal 
tumor models can be developed as tools for in vivQ therapeu- 
tic drug testing. To achieve this objective (for instance 
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for pleomorphic adenoma of the salivary gland), two approa- 
ches can be used, gene transfer (generation of transgenic 
animals) on the one hand and gene targeting technology 
(mimicking in VlVo of a specific genetic aberration via 
5 homologous recombination in embryonic stem cells (ES cells)) 
on the other. These technologies allow manipulation of the 
genetic constitution of complex living systems in specific 
and pre-designed ways. For extensive technical details, see 
B. Hogan, R. Beddington, F. Constantini, and E. Lacy; In: 
10 Manipulating the mouse embryo, a laboratory manual. Cold 
Spring Harbor Press, 1994; ISBN 0-87969-384-3. 

To aim at the mutation of the PLAGi gene, specifically 
in selected cell types and selected moments in time, the 
recently described Cre/LoxP system can be used (Gu, H. et 
15 ^ Deletion of a DNA polymerase 6 gene segment in T cells 
using cell type-specific gene targeting. Soienea 265, 103- 
106, 1994). The ere enzyme is a recombinase from bacteriop- 
hage PI whose physiological role is to separate phage geno- 
mes that become joined to one another during infection. To 
20 achieve so, Cre lines up short sequences of phage DNA, 

called loxP sites and removes the DNA between them, leaving 
one loxP site behind. This system has now been shown to be 
effective in mammalian cells in excising at high efficiency 
chromosomal DNA. Tissue-specific inactivation or mutation of 
25 a gene using this system can be obtained via tissue-specific 
expression of the Cre enzyme. 

As an example, the development of animal model systems 
for pleomorphic adenoma of the salivary glands using a 
member of the fLftS gene family will be outlined below, such 
30 that the models will be instrumental in in vivo testing of 
therapeutic drugs. 

Two approaches will be followed: 

a) Xn vivQ induction of specific genetic aberrations as 
observed in human patients ((conditional) gene (isogenic) 

35 targeting approach) ; and 

b) introduction of DNA constructs representative for the 
genetic aberrations observed in patients (gene transfer 
approach) . 
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DNA constructs to be used in gene transfer will be 
generated on the basis of observations made in patients 
suffering from pleomorphic adenomas of the salivary glands 
as far as structure and expression control are concerned; 
5 e.g. EL^ fusion genes with various translocation partner 
genes, especially the preferential translocation partner 
gene CUfflBi of chromosome 3, i.e. complete ELaSil under 
control of a strong (salvary gland-specific) promoter. 

10 EXAMPLE 11 

The preparation of antibodies against EIAfi-encoded proteins 

one type of suitable molecules for use in diagnosis and 
therapy are antibodies directed against the EMS genes. Two 
approaches have been followed to develop them. Based upon 
15 the nucleotide sequence of the ELAG cDNAs, computer analysis 
was used to predict the location of antigenic determinants 
within the proteins. Synthetic peptides containing such 
determinants were prepared and rabbits were immunized with 
these. As an alternative approach, cDNA sequences can be 
20 expressed in appropriate pro- and eukaryotic expression 

systems and the polypeptides synthesized in this way can be 
purified and used for immunization of mice. Cell lines 
obtained upon transfection or electroporation of constructs 
of the PIAG genes were helpful in characterizing the antibo- 

25 dies obtained. 

For the preparation of rabbit polyclonal antibodies 
directed against the EL^-encoded proteins, use was made of 
the following three commercially available peptides: 
(H-DLSEVRDTQKVPSGKR) 8-Multiple Antigen Peptide 
30 (H-FSSTSYAISIPEKEQPL) 8-MAP 
(H-QUPTQTQDIiQDP) 8-MAP 

obtainable from Research Genetics Inc., Huntsville, AL, USA. 
The polyclonal antibodies were made according to standard 
techniques . 

35 With respect to PIAG2 . suitable antibodies were genera- 

ted using hybrid proteins as antigens. These contained 
various portions of the PIAG2 coding region fused in frame 
to GST (glutathion-S-transferase) . 
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BX3VNPLB 12 

Inhibition of SLhSl expression in tumor cells using antisen- 
se constructs. 

5 Introdut^j-jtrin 

£LhSl can be used as target in novel therapeutic proto- 
cols for tumors in which it is implicated; e.g. pleomorphic 
adenomas of the salivary glands, uterine leiomyoma, etc. 
cancer as a genetic disease is a logical target for gene 
10 therapy, either by replacing the missing or inactivated gene 
or by suppressing the activity of an unwanted gene. The high 
affinity of short DNA sequences for their target mRNA has 
indicated that antisense oligonucleotides constitute valua- 
ble reagents to specifically suppress the production of gene 

15 products, both in vit:ro and in vivn . ADDlica*:^ k»„. u 

documented in various biomedical research areas, such as for 
instance cancer research, virology, and cardiovascular 
research. The in vitro activity, toxicity, and pharmacok- 
inetic data of antisense oligonucleotides are encouraging 
20 and in vivo animal experiments demonstrating suppression of 
neointimal formation seem very promising. Also in the field 
of cancer research promising results have been obtained 
However, potential nonspecific effects of antisense oligonu- 
cleotides Should be carefully considered when antisense 
25 agents are used to define biological functions of specific 
genes . 

Since expression of the gjLAGl gene is frequently acti- 
vated or strongly elevated in a wide variety of tumors and 
tumor cell lines, derived from tumors (rhabdomyosarcoma, 

30 uterine leiomyosarcoma, uterine leiomyoma, malignant 

salivary gland tumors) as well as leukemias and lymphomas. 
It was speculated that the EM£l-encoded protein might play 
a key role in transformation of cells. This example shows 
that expression of the PLAGl gene can be strongly reduced by 

35 expressing antisense PLAgl sequences and that reduction of 
mSi levels in tumor cells results in reversion of the 
transformed phenotype. Thus the expression or administration 
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of antisense molecules can be successfully applied 
ther apeut ica 1 ly . 

2 , Materials and m ethods 
5 2.1 Tumor cell lines 

Tumor cell lines were generated from primary tumors as 
described by Kazmierczak et al. . Genes Chromosom. Cancer 
5:35-39, 1992. 

10 2,2 culturing tumor cells 

Tumor cells were propagated in TC199 culture medium 
with Earless salts, supplemented with 2 0% fetal bovine serum 
(GIBCO) , 200 lU/ml penicillin, and 200 microgram/ml strepto- 
mycin - 

15 

2.3 Transfection assay 

Transfections were performed using various protocols, 

namely: 

!• The calcium phosphate precipitation procedure of Graham 
20 and Van der Eb ("A new technique for the assay of the infec- 
tivity of human adenovirus." Virology 52: 456-467, 1973.). 
2. Lipofection: Transfections were carried out using liposo- 
me-mediated DNA transfer (lipof ectamine, GIBCO/BRL) accor- 
ding to the guidelines of the manufacturer. 

25 

2.4 Antisense constructs 

Sense and antisense constructs of the PLAGl gene were 
obtained by inserting human PLAGl cDNA sequences in both the 
sense and antisense orientation in expression vectors under 

3 0 transcriptional control of various promoter contexts, e.g. 
the long terminal repeat of Moloney murine leukemia virus, a 
CMV promoter, or the early promoter of SV40. For example, 
the CMV/ PLAGl plasmid was constructed by cloning a human 
PLAGl cDNA fragment containing all coding sequences of human 

35 PLAGl in pRC/CMV (Invitrogen) allowing expression under 
control of the human cytomegalovirus early promoter and 
enhancer, and selection for G418 resistance. 
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3j. Results 

3.1 Inhibition of PlAGl expression 

Inhibition of EiAfil expression was observed in tumor 
cells after induction of antisense £L&Si expression in these 
tumor cells. Immuncprecipitation and Western blot analysis 
indicated a strong reduction of PLAGI protein levels in the 
cells expressing antisense £^tSl sequences. Therefore, this 
approach can be used therapeutically in tumors with involve- 
ment with PIAG^ . 
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CLAIMS 



1. Gene being in an isolated form and implicated 
in tumorigenesis having the nubleotide sequence of any one 
of the strands of any one of the members of the FLAG subfa- 
mily of zinc finger protein genes or CTNNB^ genes, including 

5 modified versions thereof. 

2. Tumorigenesis gene (T-gene) as claimed in 
claim 1 having at least homology with the zinc finger 
domains that are typical for the pt^G; gene the nucleotide 
sequence of which is depicted in figure 4A, or the 

10 complementary strand thereof, including modified or 
elongated versions of both strands. 

3. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide seauence of th*» PT.am ^ _^ . 

— '>|w.i»i^ us> vjcj^x^icea 

in figure 4A, or the complementary strand thereof, including 
15 modified or elongated versions of both strands. 

4. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG2 gene as depicted 
in figure 8A, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

^° ^* T-gene as claimed in claim 1 having essentially 

the nucleotide sequence of the CTNNBl gene as depicted in 
figure 9, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

6. T-gene as claimed in claims 1-5 for use as a 

2 5 starting point for designing suitable expression-modulating 
compounds or techniques for the treatment of non-physiolo- 
gical proliferation phenomena in human or animal. 

7. T-gene as claimed in any one of the claims 1-6 
for use as a starting point for designing suitable nucleo- 

30 tide probes for (clinically/medically) diagnosing cells 

having a non-physiological proliferative capacity as compa- 
red to wildtype cells. 

8. T-gene as claimed in claims 1-7, which gene is 
implicated in tumors selected from rhabdomyosarcoma, uterine 

35 leiomyosarcoma, uterine leiomyoma, malignant salivary gland 
tumors, leukemias and lymphomas. 
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9. Protein encoded by a T-gene as claimed in 
claims 1-8 for use as a starting point for preparing suit- 
able antibodies for (clinically/medically) diagnosing cells 
having a non-physiological proliferative capacity as compa- 

5 red to wildtype cells. 

10. Derivatives of the T-gene as claimed in 
claim 1-8 for use in diagnosis and the preparation of thera- 
peutical compositions, wherein the derivatives are selected 
from the group consisting of sense and anti-sense cDNA or 

10 fragments thereof, transcripts of the gene or practically 
usable fragments thereof, fragments of the gene or its 
complementary strand, proteins encoded by the gene or frag- 
ments thereof, antibodies directed to the gene, the cDNA, 
the transcript, the protein or the fragments thereof, as 

15 well as antibody fragments. 

11. In situ diagnostic method for diagnosing 
interphase and/or metaphase cells having a non-physiological 
proliferative capacity, comprising at least some of the 
following steps: 

20 a) designing a set of nucleotide probes based on 

the information obtainable from the nucleotide sequence of 
the T-gene as claimed in claim 1-5, wherein at least one of 
the probes is hybridisable to a region of the aberrant gene 
substantially mapping at the same locus as a corresponding 

25 region of the wildtype gene and /or the same or another probe 
is hybridisable to a region of the aberrant gene mapping at 
a different locus than a corresponding region of the 
wildtype gene; 

b) incubating one or more interphase or metaphase 
30 chromosomes or interphase or metaphase cells having a non- 
physiological proliferative capacity, with the probe (s) 
under hybridising conditions; and 

c) visualising the hybridisation between the 
probe (s) and the gene. 

35 12. Method of diagnosing cells having a non-physi- 

ological proliferative capacity, comprising at least some of 
the following steps: 
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a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating a suitable T-gene-related macromo- 
lecule therefrom; 

5 c) analysing the macroxnolecule thus obtained by 

comparison with a wildtype reference molecule preferably 
from the same individual. 

13. Method as claimed in claim 12, comprising at 
least some of the following steps: 

^) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) extracting total RNA thereof; 

c) preparing at least one first strand cDNA of the 
mRNA species in the total RNA extract, which cDNA comprises 

15 a suitable tail; 

d) performing a PCR and/or RT-PCR using a PLAG 
gene specific primer and a tail-specific and/or partner- 
specif /nested primer in order to amplify plag gene specific 
CDNA • s ; 

e) separating the PCR products on a gel to obtain 
a pattern of bands; 

f ) evaluating the presence of aberrant bands by 
comparison to wildtype bands, preferably originating from 
the same individual. 

14. Method as claimed in claim 12, comprising at 
least some of the following steps: 

a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating total protein therefrom; 

^° separating the total protein on a gel to obtain 

essentially individual bands and optionally trnasferring the 
bands to a Western blot; 

d) hybridising the bands thus obtained with anti- 
bodies directed against a part of the protein encoded by the 

35 remaining part of the T-gene and against a part of the 
protein encoded by the substitution part of the T-gene; 

e) visualising the antigen-antibody reactions and 
establishing the presence of aberrant bands by comparison 



wo 98/07748 




PCT/EP97/04759 



77 

with bands from wildtype proteins, preferably originating 
from the same individual. 

15. Method as claimed in claim 12, comprising at 

least some of the following steps: 
5 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed; 

b) isolating total DNA therefrom; 

c) digesting the DNA with one or more so-called 
"rare cutter" restriction enzymes; 

10 d) separating the digest thus prepared on a gel to 

obtain a separation pattern; 

e) optionally transfering the separation pattern 

to a Southern blot; 

f) hybridising the separation pattern in the gel 
15 or on the blot with one or more informative probes under 

hybridising conditions; 

g) visualising the hybridisations and establishing 
the presence of aberrant bands by comparison to wildtype 
bands, preferably originating from the same individual. 

20 16. Method as claimed in any one of the claims 

10-15, wherein the cells having a non-physiological 
proliferative capacity are selected from the group 
consisting of the tumors pleomorphic adenomas of the 
salivary gland, lipoblastomas, uterine leiomyomas, and other 

25 benign tumors as well as various malignant tumors, including 
but not limited to sarcomas (e.g- rhabdomyosarcoma, 
malignant salivary gland tumors) and leukemias and lympho- 
mas. 

17 • Anti-sense molecules of a T-gene as claimed in 
30 claims 1-8 for use in the treatment of diseases involving 
cells having a non-physiological proliferative capacity by 
modulating the expression of the gene. 

18. Expression inhibitors of the T-gene as claimed 
in claims 1-8 for use in the treatment of diseases involving 

35 cells having a non-physiological proliferative capacity. 

19. Diagnostic kit for performing the method as 
claimed in claim 11, comprising a suitable set of labeled 
nucleotide probes. 
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20. Diagnostic kit for performing the method as 
claimed in claim 12, comprising a suitable set of labeled 
probes . 

21. Diagnostic kit for performing the method as 

5 claimed in claim 13, comprising a suitable set of labeled T- 
gene specific and tail specific PGR primers. 

22. Diagnostic kit for performing the method as 
claimed in claim 15, comprising a suitable set of labeled 
probes, and suitable rare cutting restriction enzymes. 

23. Method for isolating other T-gene based on the 
existence of a fusion gene, fusion transcript or fusion pro- 
tein in a tumor cell by using at least a part of a T-gene 
for designing molecular tools (probes, primers etc.). 

24. T-gene obtainable by the method of claim 23. 

25. T-gene as claimed in claim 24 for use in dia- 
gnostic or therapeutic methods. 
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FIG.4A 



GGCAGCGCAT ACACTACAAT 
TTCCAGGCCC GCCGCGTCCA 
ATTCCGCGGG CGGTGTAGAG 
AGCCCCGCGG TTGCCTCTTG 
ATGCTTCATT CTGTGACGGT 
GAGCAGGGCC TCAGATTGGC 
TCCACGAAGA GTCAATGGGA 
CATCAGTAAT CAGTCCCTTT 
AGAAGCCACC TAAAGCTATA 
GTCC/^CAAG AAGGCCTGGT 
TGATTTGTCA GAAGTAAGAG 
AGCGTGGTGA AACCAAACCA 
AAGGCCTTTA ACAGTGTTGA 
AGGAGAGAGG CCCTACAAGT 
TTTCTAAGTA CAAATTACAA 
ACCCACAAGT GTAATTATTG 
GAAGAATCAC CTCCATACAC 
AAGAATGTGG CAAGAACTAC 
GCCTTGCATG CCGCAACAAG 
AACTTTTGAA AGCACGGGAG 
GCAAGTCGTC TGGTGGGGTT 
GATCGCCGGT TCTACACCCG 
CACTGGAAGA AAGGACTTCC 
GAAAGGATCA CCTGACTCGA 
CTGAAGGTCA AAACAGAACC 
TGTGTCTGTG CCTATAAAAG 
CCAGTGAACT GTTATCAAAG 
TACAACACTC CATTTCAGTC 
GATCACAACT TTACCTTTGG 
TTCATCCCTC TCACCACCTT 
TATGCAATTT CTATTCCTGA 
GAGTTACCTG ATGGAGTTAC 
CTCAAGCATC GTCATCATCT 
CTAGATGATG GTGCAGGAGA 
CAGTGACCCC CTAAACACAC 
TCATACCTTT AAATGGTCCT 
GGAATGAGCT ATTCCCAGGA 
CACACAAACA CAGGATCTTC 
CTCTGCACTC ACTGTCAGCA 
ACCCTCCCAC GTTTCCATCA 
TCATTACAGA AATGTATGTG 
TAGTGCCTAC TTTAAAACAG 
ATTTTCATTA AGCCAGTATA 
GGAACCATTT GTGTTCAGTT 
CTGCCAATTG TCACATTTTA 
ATTATTAGTA GTAAACTTTT 
CTTAAAATTT TCATTTTTGT 
TGGCTCATTA AAAAATTTAG 
TGAATAACTC AAGGGTTTAA 
AATCAAGTGC TTCAAATGAA 
GTTTGTTAGA TACACCATAA 
GTTGTCTGGC TAAAATATTT 
CAAACCCTAA ACACAAAGCA 



GGCTGCTGGA 

GCCCGAAATA 

GCGGCGGACG 

GTGCTGCCTT 

CTATTAATAA 

CAAAATGGGA 

CTGGCTAAGA 

TTGCTTTCTT 

TCATTTAGTG 

TTAGGCTGCG 

ATACCCAGAA 

AGAAAAAACT 

GAAATTAAAG 

GCATACAACA 

AGGCACATGG 

TGAGAAAATG 

ACGACCCTAA 

AATACCAAGC 

TGGTGACCTC 

TGCTTCTGGA 

AAAGAAAAAA 

AAAGGATGTC 

TCTGTCAGTA 

CATATGAAGA 

AGTGGATTTC 

ACGAGCTCCT 

CCATTCACAA 

CATGCAGAGC 

GAATGACATG 

TCTTTCAAAT 

AAAAGAACAG 

AAGGTGGCGT 

AAGCTAGGGT 

CCTCTCCCTA 

CAGCATTGGA 

CCCTATAATC 

AGAAGCACAT 

AGGATCCTGC 

GCTTTCACCA 

AGCTTTTCAG 

TAGCTGTGCC 

TATAAAAATT 

AAATAGAAAC 

AAGTTTACCT 

AGACTTTTTT 

ACAAATCCCA 

CTAATAACAG 

CAATAGAATG 

ATTTTCTTAC 

TTAAGAGTCC 

TTTCAGATCA 

GCCATCTTTA 

TCAGTATTTA 



AAGAGGCGTA 
TGAGAAAAAA 
GGCCGGAGGG 
GGCCGTATTT 
GGTTGCCTTG 
AGGATTGGAT 
TCAAAGTCTG 
TTACGACCAC 
GAGTTGGGCA 
ATGGCCACTG 
AGTCCCTTCA 
TTCCTTGCCA 
GTTCACTCCT 
AGACTGCACC 
CTACTCATTC 
TTTCACCGGA 
CAAAGAGACG 
TTGGATTTAA 
ACCTGTAAGG 
GCACCTTAAA 
AGCACCAGTG 
CGGAGACACA 
TTGTGCACAG 
AGAGTCACAA 
CTTGACCCAT 
TCCGGTGATG 
ACACTTTGCA 
TCGGGATCTG 
CCCAATAGAT 
ATCCGTTCAG 
CCATTAAAGG 
GCCCTCTTCA 
TGGATCCTCA 
TCCAAAAGCT 
TTTTTCTCAG 
CTCTATCAGT 
TCTTCTGTTT 
AAACACTATA 
GCAGTTTAAG 
TAGGATTCTG 
CTAGATGACC 
TCTGCTTTTG 
TAGCTTTTAA 
GGGTATTTTG 
TTTTTCCATA 
TTTTCAAATT 
TGGCTCTACC 
TAAATTGTAT 
TAGCTTCTAA 
AGTTTCGGAA 
GTATATTCTG 
TTATGAGCCT 
TAGCAAAAAG 



AGGAAACAAT 50 
ATTATTAGAA 10 0 
AGGATGTTAA 15 0 
GGCACCCAGA 200 
CTAGAGTTTG 250 
TCCACTCTCT 300 
AGGCTTTTTC 350 
ATGAAACTTG 400 
GTTCCCAAGT 4 50 
TCATTCCTGG 50 0 
GGGAAACGTA 550 
ACTGTGTGAC 600 
ACTCTCACAC 6 50 
AAGGCCTTTG 70 0 
TCCTGAGT^ 750 
AAGATCATCT 8 00 
TTTAAGTGCG 850 
ACGTCACTTG 90 0 
TATGTTTGCA 95 0 
TCTCATGCAG 10 00 
CGAACATTGT 1050 
TGGTGGTGCA 1100 
AGATTTGGGC 10 50 
TCAAGAGCTT 1200 
TTACCTGCAA 12 50 
TCCTTACCTT 13 00 
GTTAAACCTC 13 50 
CCCACCAAAT 14 00 
ATGGACACTG 14 50 
TTCTACCTCA 1500 
GGGAAATTGA 1550 
TCCCAAGATT 16 00 
GATTGGGTCC 16 50 
CTATCTCCAT 1700 
TTGTTTAATT 1750 
GGGGAGCCTT 18 00 
CCCAGCTCCC 1850 
GGGCTTGGGT 1900 
CACAAGTACC 19 50 
GGACATGGAT 2000 
ATTTTTATTT 2050 
TATAATACAA 2100 
ACTGAGCTTT 2150 
TCCTGATTCA 22 0 0 
TAGGAAAGCC 2 2 50 
ACTTTTAGAT 2 3 00 
TTTTGACATC 23 50 
AAAAAGTTTG 24 0 0 
ATGGATTAAT 24 5 0 
GATAATAAAT 2 500 
AAGACTCTCT 2550 
TTAAGGAAAA 2600 
AGACTCTGTT 26 50 
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AGGTGACATG 
CAGGATATTT 
AATAATAGTA 
TAGATCTCTT 
GCAGCCAGCA 
GAGCCATTGT 
GACTCAGCCT 
GCACATGGTT 
AGGGTTAAAA 
ATCATTTCTT 
GCAGAAAACA 
TGTGAATCCA 
ATGCATTAGA 
AAATATATAC 
AGTATTGCAA 
AAACCTATCT 
ACTGAATAAT 
TTAAAGGTCT 
TAAGAAACAT 
TCCGTTGGGG 
TCTTTAAAAA 
ATATGGAAAC 
AATGTTTACC 
CATTATTTAA 
GTAATAATGT 
ACACACATGC 
GGAGCTTTGG 
TGGAGCAGTA 
ACATTTATTA 
TAAGGAAGTA 
GAGAATATTT 
CCTTTAGTTT 
TTAGTGTTCA 
ACAGAGCACA 
GTGTGGAAGC 
AATTAGTGAA 
TAAAAAAGGG 
AATAGCGCCA 
ATATACACAC 
TACATGGAGG 
ATTTTCATGT 
ACACTTCAGA 
TAGTGGTCAC 
ACTCGCTTAT 
TGGCTTTTTT 
ATGATTAAAA 
AATCATTATC 
AAAAAAAAAA 
TTTGGGCCTC 
TATGTACAAG 
GGCACTGTAA 
ATCGATCACT 
TTTTTAAAAC 
CTTTTTTATT 
TGTTGCTTGT 



GCATTTCGTG 
TGTCGTGTTT 
ATAGTGTATG 
TAGTCTTTTC 
ATTTATTTTC 
GGAACTGCAG 
CTGTGGGTGA 
ACATTTGGGA 
ACTGGACTTT 
GCTGAATTTG 
AAAACAACAG 
TCTTTCCTTG 
AAACTACCTG 
AGTAAAATCC 
ATAAGTTGAG 
TGGTTAACAT 
TATAGGCTTT 
AATTAATATA 
ATCTATTAAG 
ATGGGGGCAA 
AAAACAGA-AA 
TTCTTTCCTT 
AAGAATTTCT 
AATATTTGGG 
AAATGGGGGG 
ACACATACCG 
CTCCTGATAT 
GTTTTTATAC 
AAAAAGTGTG 
CATTTTTAAA 
TTTCTGAAAG 
CTATGAAACA 
ATATTGCTTT 
AATAGAAAAC 
ACTACTTTAT 
CTGAGACATT 
AGTACCAGGA 
TATATGTATG 
ACACATACAT 
CACATCTTCA 
GTACACCTCT 
GTAAGAGGGA 
TAAACCCTTT 
TTGAATTGCA 
TTCCCTAGAA 
ATAAGTATTG 
CTTGAAATTT 
AAAAAAAAAG 
AGACAAGATA 
TTAGGTCACC 
GCAATGCTAT 
GTTAAAACTT 
TTCAATTGTC 
TCATTTAGTG 
GCTTTATTTT 



TCACTTAATA 
CATCCTTCTT 
GCATTGGGGT 
CACCTTTCAC 
ATTGTTGGCC 
CCATGAGGTG 
CAGACTTCTA 
ACTTGTGTAG 
TTAGAAGTAA 
ATATGTTTTT 
TCCTACTCCG 
CAATATTTTG 
ATGAACTACC 
CACTTTCATG 
TTTGTAGAGG 
GAAAATAACA 
TCCTCACATT 
TTTTTCTAAG 
CACTTGTTAA 
GGGTGGGAGG 



i^7\7\A'nT\nTA i-nrn 



TTTGTAGGGC 
AAAAATGAGT 
AGTAGTATAA 
CTTCAATATA 
CACTTCATAG 
TTTATCATGC 
TTATGTATTT 
TTCCAAAGGC 
TTTTTCAAAC 
TAGGCTTTTC 
GATTGCTTAC 
AATCACCATA 
CTATTTTTAA 
TCTGTTTAAA 
GGCCTTAGTA 
TTTATTAAGT 
TATGTGTATG 
ACTTAAATCT 
GGGCACCAGT 
TTGCCTGTTC 
ATTCAGCTAA 
TTGAGAGAAT 
CAATGTTCTA 
AAAGATTGTT 
CCAATGCTGT 
CTGGTAGTGC 
GGATTAACAT 
TTGAACCTCA 
AAACACGGAA 
CCATTGATGT 
TCATTTTAAA 
CTGGCTGATT 
TTTCTTTCAA 
TCTTAGTCAT 



GTTGGCCCTA 
AACATGCTAT 
CTTCAGAGTC 
ATCCAAGGGG 
CACGGTTAGT 
TGTTTATCCC 
AGCAGGAAGA 
GGATCATGGC 
AGTAAAAGCA 
CTTTCCCTTA 
ATGTTATCTT 
GATGGAGAAT 
ACAAAGTTTT 
CATCTCTGGG 
GTAACAAAGT 
ATTGAGAATA 
AGACAACCAA 
GGTCAGTTGG 
CACCTTATTT 
TTTTTAGAAG 
TCTGAGCACT 
CAGTTATCAC 
GC AGATTACT 
TTTGTTGAGA 
TATATATAAT 
AATCAAAGTT 
TCCTATATTT 
AAATTTTATT 
ATTAAAATTA 
TGCTCCTAGC 
GCTCTGCTTC 
CTAAATCTTT 
TAAAAGGAAA 
ATAGAAATCA 
ATGTACTTAA 
GGCTGTATTC 
AAAGCATTTT 
TGTGTGTGTG 
TGCCCTGCAT 
GTTAAAATTT 
CCACCCCCAG 
TTTGTTTTTA 
TTCTATTAAA 
ACAAGGATGT 
TGTTTCTATG 
TTTCATTCTC 
CTTAGCTTGG 
TAAATAAAAG 
TTCAGTTTCA 
GTTGAGTGTG 
ATACAAGTAC 
ATCCTATTAC 
ATGCATCACT 
GCTGTGTATT 
TTGTGGAATA 



AATTAGTACA 
CT7TTCATTT 
GATATATAGG 
TGGGTCAAGT 
CCATAATCTA 
ACAGTGGATT 
TAGACGTGAA 
CCCTGTAGCC 
TAKCGCTTAT 
AGAATCAAAA 
TCTGATTCAA 
TTGAAGTTAA 
AAGTGACTAG 
AAATGATAGG 
AAAGTAAAAC 
TATTATATTC 
CATAATCTTC 
GACATTAACC 
TAGGACCCTT 
AGTATATATC 
CATTAGCCCT 
TGCAGATTGC 
GAATATAATA 
AATGTAAATT 
ACACACACAC 
GCTCTCTGAA 
TTTTAATCCT 
ATGAAAAATT 
TATATGTTAA 
TTTTGATTAG 
ATTACTGCTT 
AGTTGAATGA 
AAAATTGGTG 
CAAATAGCAA 
GAAGTCATCA 
ACTGCTAATT 
GGAAATGGGG 
GTGTGTGTAT 
GAAATTCAAA 
TGGAGTCTTA 
ACTTGAAATA 
AAATTGACTG 
GATGAGGCAG 
AACACAGAAT 
TCAACTAGAT 
TAGTGGCCAG 
TTAAAAAAAA 
TAGTTTAGAA 
CTTCCACATG 
GAAGGATCTT 
CTTTATAGTT 
CAAGTTCAGT 
CTGTGTGCAA 
TTTGCCTATT 
TAGTGATATA 
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FIG.^A (continued) 

TTGTGTTAAT TTGGACAGTA GCGGTTTTTA AAAACCATAT ACTGACTGAA 54 50 
ACATGAGCCA GAGCCGATTG CTTTATTAAG CTAATAATGA ATGTTAAAGA 5500 
GTACATATTT TCAGGATCGT TCATCTAGTG AGCAATACAC ATATTATAGG 5550 
CCAATATTTT TTTAAAAAAT AGAGCTTGGT CAACCTCTAT ACTACACATA 5600 
TTACAAGATA TAGCACTTTC AAAATGAATC TAAACCTTTA CAGAAACTTT 56 50 
CTTATAGGTT ATGCCTTTTA TTTTAAGACT TATTATAATT CAAGTGCCAT 5700 
TAGATGATAT ATATGTAGGC CTTTGATATA TAATGCTTTG TGTACAAAAA 5750 
TGGTAGATGG TATTTTAAAC AGGTACATTT TTACAGTGTT TTCTTATCAA 58 0 0 
TTTGCTATAT TGCACAGAAT CAGTGTGTGT CTTTTCATAA GGTTTTACAA 5 8 50 
TGGTTTATTT TTTTACAAGG TTTACGTGTC TCAAAGCACA CTGTCTTCCC 5900 
AGTACGTAAG TTAAAAAATA CCAGTTCACC CAAGTTGCTT CTAGCCTACT 5 9 50 
GAGATCCATG TGACATTGGA GGAGATCTTT TAAATGTTTA GTATTCGTCA 60 00 
TTAGCAATGG CTGGCTGTTA GTTCTGGTAA ATGTGTGCCT AAGTTGAATT 6 050 
TGTCTTGTTT TTCTCACACT GTGTCAGCAG CCATGTCTAC AACACAGATA 6100 
AGTCTGTTGT GATCACATAG ATCTACATAA GTTGTGCAGT TTTGTGCTAA 615 0 
AAACCCATAG GGAGCTCCTT TGGGATCATA GAAAAGAAGA TCATGCAACC 620 0 
AGCATTGGTG AAGGCACACT CAGATTGCAC TTAGGGCCTT TCTATGATGT 62 5 0 
TGTCAACCCT CTGAGGATGG AAGGCAGTGT CTTTTGATGT TATCTAGCCT 6 3 00 
AGAAATGACA CAGAACTATT GCTAATGTAT AAAACACTTC ATTATATAAG 6 3 50 
CTTCAGTGGT ACAGATGAAC CAGAATGAAT GTTTATCTTC TCAGAAACAC 64 00 
TCCTTCAATA TTATATTGGA TCATGCTGCT AATGTAACTT GGGCTACAAC 6 4 50 
TCTTCATGGT GCTACAAACT TCTCTGTCTC ATTCAGTCGT ATTTTTTTAT 6 500 
CCATAGAAAA AGGACTACAT TAGGTGTAAA AGTGTACAAT ATATTTTTAT 6 5 50 
ACTGTGACTT AATTTGTCAT TAACAAACTT TTACACCACC ACAATGTATT 6 6 00 
CATGTGCACT TGCAAAAGGA GATCTCGGAC ATGCAAATGT TACCAGAACA 6 6 50 
AACCCAGCTT TTGTCCACAA GGTGACTGTA ACTCAGAATG GAAAGTGGGC 6 700 
TTTATAATAG GGTGTGGAGT GAAGAACATG CTGTATGTTA CTAACAGCCC 6 7 50 
TTTGAATTTA ACAAAAACTG GGAATCCATT AGGAAACGGA TTGCATCATA 6 8 00 
CCTGAACATA AGCTGGACTG CTGAAATTGT ATTTTTAGCT AATGAAAAAG 6 8 50 
TGTTTGGACT AGTACTCTAA AAATGTTCTA ATGATAAAGT TTTGAGTCAA 6 900 
AATAGAAAAG AAAAAAATCT GCATTCCAGG CCGAATTTTG TATATTTTTA 6 950 
TTGCATTtAA AATTGCTATT C7GTAATATT GGGAAATCAA GTGGCTTATC 70 00 
ATGTATATCG TGTACTTAAA ATGTATTCAC AAACTACTGT TGTATTTGTA 70 5 0 
TAAAATATAG ACAAAGATCA TATTTTTTGT GTGTGTATAA GCTCTGTAAA 7100 
ATAGCAATCA CATTATGAAG CTGCAGTGAT ACTACATTTT AAACATTCAC 7150 
ATCCAAAGAA GCAGACTATT TATTGTCCAT ATACCAGATT TAAAATATTA 72 00 
ATTTGCTGCT AATTAAATAA TAGTACTGCA GCTTCTTGTG GCCTACAGTG 72 5 0 
TTATGTTTGC TGTAAGAATA AGATATGTGA ATTCCACAAA ATATATGAAT 7 3 00 
AAAATCTCGT GCC 7313 
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FIG. 10 

STSs used to generate the 300 kb cosmid contig mapping 
at chromosome 8q1 2 and encompassing PIAG1 



STS CHI 29 

GAATTCTAAAACCATTTATAAATCATACTGAATCCCAGAACAATATATTTTAAACAACTTAA 

AAAAAAGAACAAAATAAAATAGCAAAACATTTTTAAGAGTGTAGATTCTTTGAAATTAAAGG 
ACATACTTACCCTGTAGT 



STS CH280 

GAATTCTTGCACCGGTTTTTTCTTATCAGTGTGGGCTGATGTTCCATTAACTGTGGTGTAAT 

TTGAGTATAGTCACTGACTGATTCTAGATATTTTCAGAGGGTCAAGACTTTTTCTAAGACCT 

TTATATGTGGTTGAATTCTTGTTCTTGGTTTCACAGAAGGTATATTAGCAAAGCATTTTTGG 
TGTTGAAGCTTGGTCTGTGATCTAGT 



STS CH33 

GAATTCGTTTTTATTTGACAAGCACATGAAGCCTTATCAGACGGAGGCCTCAATCCTTTGGC 
TGGGGTTTATAAGCAGGTAGCGCTAGACCTTCCCATTCTACATAAGCTGATGGGCACGGTAA 
TAGCTGGGGGTTTTCTCACAAGTCAAAGACAAATTGTCTGTTTTCAAGCGTGTGAAACAGTT 
wAAWACGTTTGAGGTCTCTCTCTTGCTTCATAGGCCATCTTGGCTCAGACATTCTACAGMCA 



STS EMI 56 

TCTGAGCAACAAGAGCGAAACTCCATCTCAAAATATATATATATATAGGTAATTGTTGTCAT 

TAATATTAATGTAGTAGCAGCAGCAACAGTCATGGTAGCAATATTGCTCTATTTGGGAGGCA 

ACTTATAATTATTAACTGTGGAATATCTTTGAAAAATGTTTTTNGCAGAMGTTATGTTCCCA 

TTCCTGACTGGMGCTCATTATAAATACCCATCTTCTCTGAATAGCGCAAGGACTTTTGAAAA 
AGTGTTCTGAGTAAAC 



STS EMI 95 

ACAATCAATTTTAGAAAGTAATCATTTCATTACCCCAAACTGAAACCCTGTACCTGTTAGCA 
CTCACTCCCCTTTTCATTTTACTTTTTATTTATTTATTTTTTTGAGAGAGACTTGCTCTATC 
GCCCNGGCNVCAGTGCAGTGGCACAaTCTCAACTCACTGCAACCTCTGCCTGCCAGGGTCAA 
GTGATTCTTGTGCCTCAGAGTCCCAAGTACCTGGGATTACAGGCATAAGCCACCACGCCTGG 
CTAAATTTTGTATTTTCAGTAGTGACGGGGTTTCACCATGTTGGCCAGGCTGTCTCAAACTG 
CTGACCTCAGGTAATCCACCCTCCTCAGCCTCCCAGAGTTCTGGGATTACAGGCGTGACACC 
GTGCCTGGCTCATTTTATTTTTTAGAGATGAGATCTCACTCTKWTTGCCCAGGCTTCAGTGC 
ATTGGCGTCATGATGGCTCACTGCAGGCTTCAGCTCCTGGCCTCAAaGCATCCTTCCGCCTCA 



STS EM208 

CTAGGCGACAGAGCAAGACTCTGTCTCAARGAAAAAaAAAAAaVRAAAAAAATTACCAAAAC 

TGACTACAGAAAAVHGVARGGTTGAATAGCCTTACATTTGGVAAATAATTTTTATTTATAAT 

TAAAGATATTTTTATAAAAWTACTCTAGGCCCATAAGGCTTCACAGGTTAATTGTATTAAA 

TAmAAGGAAAAAATAATACCAATCTTATTCATAGTCTTTCAGAAAATAGAGGCGTATCCA 

TTTTTCTAACTCATTTTAAGAAATAGCATCATTCTAATATCAAAAGCAAACAAGGMCATTGC 

AAAGAAGAAGGGGAAGAAGGAAGAGGAAGAGGAGGAGAAAGGGAAGCAGGAGATGGAGAAGA 

AGGAAGCCAGGTACAGTGCAATATTTCTCATGAACATAAACACAATTTTTAAAAAGTATTAM 

CAGGCTGGGCTTGGTGGCTCTCACCCGTAATCCCAGCMCTTTGGAAGGCCcAAGGCGGGTGG 
GHCACAAGGTCAGGGGTTCGAG 
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FIG. 10 (continued) 



STS EM216 

GGGKCTATTTAATAGGTTTCCTAAAGAACAAACATATCTCTTTAbAGTTACTCAGAGGGTAC 

ACAATGATGATGTCACACAATTAATTACCTATTAAGACTGAAATCCAGCAATGCA^^ 

TGGACTTTACGCACATCCAGAAAAAGTTCTAGCACAAATTGTmHGTM^ 

aagccatagaaacactattaaagccctccctaatcacttISSgS^c^^ 

STS EM317 
STS EM416 

?'^2S™^S^^SS""^^"'^^^^'^^^^^"^"aagatgtgggctgcacaggtgaggctggagaggt 

STS EM443 

^™S5I^29T^g?aggtaccctaacagcaatgggtcagcttttgagtagcgtttcaaccat 

tcgagc 



JIJSS^T^^agtacggtgtggtgaggccagacgcagatggagagaaagaaacagaa' 



ATAACTTCAGGGGGgCTCTAGGGCCTGgRGGCTGCCCCTGGlTGT^^^^^^ 
STS EM46 

STS EM47 

mm™°^^^^*^'^^^^^^'^^^ACAGACCAGTGCAGAATGAACAATATTTGTTACATGTGTAr 

ttctttatggtttacaaaactctcccagccattatcttctttcagccSSaaS 

CATATTTTATTATCCTCATTTACCTWHTCTAGTAAGG?SS5^^ 
?JtIfJ^^^^*rTAGGAAAAAAGTGAATACTACGATAAATGSTACT£GS^ 

atcacaaatttattaatatcagaaaacagdttttaagaat^Saat^^ 
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FIG. 10 (continued) 



STS END2 

TAATTTATCACTACGGAATTCTGTGCAGTGAGATCAAAGAGCTGTGTATGCCCATAATGTGA 
TTTTACAGCCATTTTGTAAAAACTGTAAAATACCTTAATATTCAATTTGGCTTAAGGTACAT 
TGAGGACTTCTGGTTGAAAATTACAGAGTGGTGAAGATTC 

Known STSs 
PENK 
D8S285 
MOS 



STSs part of PLAG1 

EM265 

KK64 

KK63/EM209 
KK55/CH283 
EM224 
EM387 
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